Gene Shewmr4_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1462 
Symbol 
ID4252040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1710416 
End bp1712125 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content52% 
IMG OID638118061 
Productanthranilate synthase component I 
Protein accessionYP_733597 
Protein GI113969804 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTAA AGACATTTAA TCAGGTTAAC CAAGTTGATG GCGAAAATGT TGCCTCGTCT 
CAACAGACAT TCGCACGTTC ACATACGCTC AAGGCTGCGC TGGCATACCA TAGCGACCCA
CTGCGTCTGT ACCAGCACAT CACTCAAGAT GCGCCCCATA CCATGTTGTT AGAATCGGCG
GAAATCGACA GCAAGGAAAA TCTTAAGAGC ATGGTGATGA CCCATGCGGC GCTGATGATC
CGCTGCGACG GTTATCGTTT ACGCTTTAGC GCACTGAGTG ACAATGGCAT CAGTTTACTC
TCCCCCATCG AGCAATTTTT TACTGCTCGC GCAAGCCAAA CTCTATGCCA ACGCGATGGC
CACAACTTAG TGGTCACGCT GCAAAAGGAT ACCGAGCTTA AGGATGAAGA TGCGCGCTTA
AAATCCACCT CACCCCTCGA TGGTTTGCGC TTGTTTGTAA AGCATATCGA CTGCGGCGCT
CATACTGACA GCCAATCCAA GCCAGCATTC GAAGACTTAT TTTTAGGTGG CGTGCTCGCC
TACGATTTGA TTGATACCGT CGAGCCACTG CCTGAAGCAC CAAATGGTGC AAATGATTGT
CCTGATTATT TATTTTATCT CGCCGAAACC TTAATTCTTA TCGATCACAA GCAAAAGCAC
GCCGAGCTAA TCACCCACCA CTTCAGTGAA GGCACAGAAA AGCATTCCGT TGTGACCCAA
GCCTTAGCCG AGCGAGCAGA GAATATCCGC GCCCAATGTG AAGCCTTAGC CAAGAGTGCA
ACACCTGCAC CCGCCCTCGT TGGCATAACG GCCACAGAGC AAGTGAATGT CAGTGATGAG
GACTTCAAGC AAACTGTTAT CGATTTGAAA GAACACATTA TTGCCGGCGA CATCTTCCAA
GTCGTGCCTT CACGCAGCTT TAGCCTGCCC TGCCCCAATA CCTTAGGCGC TTACCGCGCG
CTGCGATTAA CCAACCCCAG CCCCTATATG TTTTATTTCA GGGGCCATGA CTTCACCCTT
TTGGGCGCCT CGCCCGAGAG CGCGCTGAAA TATGAAGCCA GCAACAATCA AGTCGAAGTC
TACCCGATTG CAGGCACCCG TAAGCGCGGC AAAACCGCCA GCGGCGAGAT TGATTTCGAC
CTCGATAGCC GTATCGAACT CGAATTGCGT TTGGATAAAA AAGAGCTTTC CGAACACTTA
ATGTTGGTCG ATTTAGCCCG TAACGATATT GCCCGAATCA GCCAAAGCGG CAGTCGCAAA
GTGGCCGAGT TACTTAAAGT CGACCGTTAT TCCCACGTGA TGCACTTAGT CAGCCGCGTC
ACAGGTCAAC TACGCCAAGA CTTAGATGCA CTCCACGCCT ACCAAGCCTG CATGAATATG
GGCACGCTAG TTGGCGCCCC GAAAGTGCGC GCCTCGCAAC TCGTACGTCA GGCAGAAAAG
ACCCGCCGAG GCAGCTATGG TGGCGCAGTG GGTTACCTCA ACGCCCTTGG GGATATGGAC
ACCTGTATTG TTATCCGCTC CGCCTTTGTG AAAAACGGCG TGGCCCATAT CCAAGCGGGT
GCTGGCGTAG TGTTTGATTC CGATCCCCAG AGTGAAGCCG ATGAAACGCG CCAAAAGGCG
CAGGCGGTGA TTTCTGCCAT CAAGATGGGC GCTGGCTTAG ATAAAAGCCA TCAAGCAACT
GCGACTACGA CGACTGAACA ACCAAGATAA
 
Protein sequence
MTLKTFNQVN QVDGENVASS QQTFARSHTL KAALAYHSDP LRLYQHITQD APHTMLLESA 
EIDSKENLKS MVMTHAALMI RCDGYRLRFS ALSDNGISLL SPIEQFFTAR ASQTLCQRDG
HNLVVTLQKD TELKDEDARL KSTSPLDGLR LFVKHIDCGA HTDSQSKPAF EDLFLGGVLA
YDLIDTVEPL PEAPNGANDC PDYLFYLAET LILIDHKQKH AELITHHFSE GTEKHSVVTQ
ALAERAENIR AQCEALAKSA TPAPALVGIT ATEQVNVSDE DFKQTVIDLK EHIIAGDIFQ
VVPSRSFSLP CPNTLGAYRA LRLTNPSPYM FYFRGHDFTL LGASPESALK YEASNNQVEV
YPIAGTRKRG KTASGEIDFD LDSRIELELR LDKKELSEHL MLVDLARNDI ARISQSGSRK
VAELLKVDRY SHVMHLVSRV TGQLRQDLDA LHAYQACMNM GTLVGAPKVR ASQLVRQAEK
TRRGSYGGAV GYLNALGDMD TCIVIRSAFV KNGVAHIQAG AGVVFDSDPQ SEADETRQKA
QAVISAIKMG AGLDKSHQAT ATTTTEQPR