Gene Meso_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_3904 
Symbol 
ID4181876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp4206086 
End bp4207243 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID638069796 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_676436 
Protein GI110636228 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.307988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG AAGACATTTC CACAATCGGC CGCCGTAGTT TTCTCGGCGT GGCGGCAACG 
ATGGCGGTAG CTCCTGTCCT CATTGGCGCC ACTTCCAAGG CGTGGGCGCA GGCTGGCGGC
CCGGATGGCT CCATAGCAGC GTCGTCGAGC AATTCCCGCA ACCTTGGAGC GCTGGAGGTT
TCGAGCGTCG GGCTGGGCGT CCAGAACATG GCCCGTACCT ACCAGACGAC GATCCCGTCG
CGGCCGGAGA TGATCAACAT CATCCGGACG GCCCATGAGC GCGGCGTTAC CTTCTTCGAC
ACCGCCGAGG CCTATGGCCC GCACGAGTGC GAGCGGATTC TCGGCGAAGC TATCGCGCCG
TTCCGCGACG AGGTCGTGAT CACCTCGAAA TTCGGCTGGA ACATCGATCT GGACACCGGC
GAGCGTCGAC CGGGACTCAT CAGCCGCCCG GACCACATCA AGCTCGCGGT CGAAGGCATG
CTCGAACGCC TGCGCACCGA TCGCGTCGAT CTTCTCTACC AGCACCGCGT CGACCCGGAG
GTACCGATCG AGGACGTCGC CGGAGCAATC AAGGACCTCA TGGACGAGGG CAAGGTGCTG
CATTGGGGCC TCTCCGAGAT GGGGCTCAAC ACGTTGCGCC GCGCCCACGC CGCCCTTCCC
GTATCGGCGG TCCAGAGCGA GTATTCGATG CTGTGGCGCG GTCCGGAAGA CGAGGTGCTT
TCCGTTTGCG AGGAACTGGG CATCGGCTTC GTGCCGTGGA GCCCGCTCGG GGTCGGCTTC
TTGACGGGAG CCATCGACGA GCGGACGCGT TTCGCGGAGG GAGACATTCG CGGAATCGAG
TCGCGCTTCT CGCCCGATAA CCTGCCGGCG AACCTGGCGC TCGTGAGATT GCTAGGCGAA
TGGGCCGACA GGAAGCAGGC CACCCGGGCA CAGATCGCGC TGGCATGGCT GATGGCGCAA
AAGCCCTGGA TCGTTCCGAT CCCCGGCACG ACGCAGATGG CGCACTTGCT CGAGAACATC
GGCGCGGCTT CGGTAAGCTT CACGCCCGAG GAAATCGGCG AGCTCAACTC CGCGGTCGCA
GCGATCGAGG TGCGCGGCCA GCGCCTGCCC GACGCCGTGC TGGCATTCTC GAACGTCGAG
GCGCCTGAGA GGAATTGA
 
Protein sequence
MKKEDISTIG RRSFLGVAAT MAVAPVLIGA TSKAWAQAGG PDGSIAASSS NSRNLGALEV 
SSVGLGVQNM ARTYQTTIPS RPEMINIIRT AHERGVTFFD TAEAYGPHEC ERILGEAIAP
FRDEVVITSK FGWNIDLDTG ERRPGLISRP DHIKLAVEGM LERLRTDRVD LLYQHRVDPE
VPIEDVAGAI KDLMDEGKVL HWGLSEMGLN TLRRAHAALP VSAVQSEYSM LWRGPEDEVL
SVCEELGIGF VPWSPLGVGF LTGAIDERTR FAEGDIRGIE SRFSPDNLPA NLALVRLLGE
WADRKQATRA QIALAWLMAQ KPWIVPIPGT TQMAHLLENI GAASVSFTPE EIGELNSAVA
AIEVRGQRLP DAVLAFSNVE APERN