Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1462 |
Symbol | |
ID | 4252040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1710416 |
End bp | 1712125 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638118061 |
Product | anthranilate synthase component I |
Protein accession | YP_733597 |
Protein GI | 113969804 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTAA AGACATTTAA TCAGGTTAAC CAAGTTGATG GCGAAAATGT TGCCTCGTCT CAACAGACAT TCGCACGTTC ACATACGCTC AAGGCTGCGC TGGCATACCA TAGCGACCCA CTGCGTCTGT ACCAGCACAT CACTCAAGAT GCGCCCCATA CCATGTTGTT AGAATCGGCG GAAATCGACA GCAAGGAAAA TCTTAAGAGC ATGGTGATGA CCCATGCGGC GCTGATGATC CGCTGCGACG GTTATCGTTT ACGCTTTAGC GCACTGAGTG ACAATGGCAT CAGTTTACTC TCCCCCATCG AGCAATTTTT TACTGCTCGC GCAAGCCAAA CTCTATGCCA ACGCGATGGC CACAACTTAG TGGTCACGCT GCAAAAGGAT ACCGAGCTTA AGGATGAAGA TGCGCGCTTA AAATCCACCT CACCCCTCGA TGGTTTGCGC TTGTTTGTAA AGCATATCGA CTGCGGCGCT CATACTGACA GCCAATCCAA GCCAGCATTC GAAGACTTAT TTTTAGGTGG CGTGCTCGCC TACGATTTGA TTGATACCGT CGAGCCACTG CCTGAAGCAC CAAATGGTGC AAATGATTGT CCTGATTATT TATTTTATCT CGCCGAAACC TTAATTCTTA TCGATCACAA GCAAAAGCAC GCCGAGCTAA TCACCCACCA CTTCAGTGAA GGCACAGAAA AGCATTCCGT TGTGACCCAA GCCTTAGCCG AGCGAGCAGA GAATATCCGC GCCCAATGTG AAGCCTTAGC CAAGAGTGCA ACACCTGCAC CCGCCCTCGT TGGCATAACG GCCACAGAGC AAGTGAATGT CAGTGATGAG GACTTCAAGC AAACTGTTAT CGATTTGAAA GAACACATTA TTGCCGGCGA CATCTTCCAA GTCGTGCCTT CACGCAGCTT TAGCCTGCCC TGCCCCAATA CCTTAGGCGC TTACCGCGCG CTGCGATTAA CCAACCCCAG CCCCTATATG TTTTATTTCA GGGGCCATGA CTTCACCCTT TTGGGCGCCT CGCCCGAGAG CGCGCTGAAA TATGAAGCCA GCAACAATCA AGTCGAAGTC TACCCGATTG CAGGCACCCG TAAGCGCGGC AAAACCGCCA GCGGCGAGAT TGATTTCGAC CTCGATAGCC GTATCGAACT CGAATTGCGT TTGGATAAAA AAGAGCTTTC CGAACACTTA ATGTTGGTCG ATTTAGCCCG TAACGATATT GCCCGAATCA GCCAAAGCGG CAGTCGCAAA GTGGCCGAGT TACTTAAAGT CGACCGTTAT TCCCACGTGA TGCACTTAGT CAGCCGCGTC ACAGGTCAAC TACGCCAAGA CTTAGATGCA CTCCACGCCT ACCAAGCCTG CATGAATATG GGCACGCTAG TTGGCGCCCC GAAAGTGCGC GCCTCGCAAC TCGTACGTCA GGCAGAAAAG ACCCGCCGAG GCAGCTATGG TGGCGCAGTG GGTTACCTCA ACGCCCTTGG GGATATGGAC ACCTGTATTG TTATCCGCTC CGCCTTTGTG AAAAACGGCG TGGCCCATAT CCAAGCGGGT GCTGGCGTAG TGTTTGATTC CGATCCCCAG AGTGAAGCCG ATGAAACGCG CCAAAAGGCG CAGGCGGTGA TTTCTGCCAT CAAGATGGGC GCTGGCTTAG ATAAAAGCCA TCAAGCAACT GCGACTACGA CGACTGAACA ACCAAGATAA
|
Protein sequence | MTLKTFNQVN QVDGENVASS QQTFARSHTL KAALAYHSDP LRLYQHITQD APHTMLLESA EIDSKENLKS MVMTHAALMI RCDGYRLRFS ALSDNGISLL SPIEQFFTAR ASQTLCQRDG HNLVVTLQKD TELKDEDARL KSTSPLDGLR LFVKHIDCGA HTDSQSKPAF EDLFLGGVLA YDLIDTVEPL PEAPNGANDC PDYLFYLAET LILIDHKQKH AELITHHFSE GTEKHSVVTQ ALAERAENIR AQCEALAKSA TPAPALVGIT ATEQVNVSDE DFKQTVIDLK EHIIAGDIFQ VVPSRSFSLP CPNTLGAYRA LRLTNPSPYM FYFRGHDFTL LGASPESALK YEASNNQVEV YPIAGTRKRG KTASGEIDFD LDSRIELELR LDKKELSEHL MLVDLARNDI ARISQSGSRK VAELLKVDRY SHVMHLVSRV TGQLRQDLDA LHAYQACMNM GTLVGAPKVR ASQLVRQAEK TRRGSYGGAV GYLNALGDMD TCIVIRSAFV KNGVAHIQAG AGVVFDSDPQ SEADETRQKA QAVISAIKMG AGLDKSHQAT ATTTTEQPR
|
| |