Gene Meso_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_2940 
Symbol 
ID4181834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp3197761 
End bp3198861 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content57% 
IMG OID638068824 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_675477 
Protein GI110635269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.381466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTATC TGAACCGCAT GCTGAGCCGC CGGCGCTTTC TTGAGGTATC GGCATTTACA 
GGGGGAGGGC TGTGTTTGTC CGGCGTCGGA GCGTTGGCCC AGGACGAGAT CGCGAGGCTA
TACGAAGATG CAAAGCAAGA GGGCGCGGTA TCATATTACG CGGGCGGCCC CATCGCTCCG
CATCAAGCGG ACATCGAAGC CTTTTCCAAA GAGTTCCCGG GCATCAGCTT CGATCTGAAG
ACCGGCTTTT CAAATCAACT TGTGCCGCCG ATCAACGACC AGATTGCTGC TGGCAAGCTT
GAGGCCGACA TCGCCAACCT GCAGACAATC CAGGACATCG AAGCCTGGAG GCAGGCCGAG
GTCCTCGCCT CCTACCGCAG CCCGAATTTC GATGCGATCC TCGACACGTT CAAGGAAGAG
GACGGCACCA GCGTTGGCGT GCATGTCTAT GGTTTATGCT ATGGTTACAA CCCTAATCTG
GTGGCGGCCG AAGATGTGCC GAAATCGGCG CCTGACTTCC TGGATTCAAA GTTCCGCGGC
AAAATCATTT CCACCTACCC CCAAGACGAC GACATCACGC TCTATCTGTA CTGGACGATC
GTCGACAAAT ATGGCTGGGG ATACATGTCC GAGCTGATGA AAAATCAGCC GCGCTTTATT
CGTGGTCATC TGGGAATCGC GCAGGAGATC GCGGCTGGTA ATGCGGCACT TTCATTCGAC
GCATCGGTAA CCACGATCAC CCGCGCGCAA GCCGCCGGGG GCACAATCGA GGCCGCATTC
TCGCAGACTG ACCCCATGCC TATCTGGGAC AATCGACTCT GCATCTTCAA GGATGCGCCG
CACCCAAATG CCGCGAGACT GTTCCAGACT TGGCTTCTAT CACGCGAACA TCAGATCGCG
GCTGGGCTCT GGTCAACGCG CCGCGACGTC GAACCGGAGG GCGGAAAATT CAAAGGCATC
GACGAGTATT TCACGGCGCA CAATTTCAAG GATTTTATCC TGCAGCCGGC TGACGAGCTC
CAGGAGTTGC GCGTACGTTT TGAGTCCTTT ATCGGGCCCG TTGAGGGCAT GCCGGTGCTT
TCTGCGCCAG CCAAAAACTA G
 
Protein sequence
MHYLNRMLSR RRFLEVSAFT GGGLCLSGVG ALAQDEIARL YEDAKQEGAV SYYAGGPIAP 
HQADIEAFSK EFPGISFDLK TGFSNQLVPP INDQIAAGKL EADIANLQTI QDIEAWRQAE
VLASYRSPNF DAILDTFKEE DGTSVGVHVY GLCYGYNPNL VAAEDVPKSA PDFLDSKFRG
KIISTYPQDD DITLYLYWTI VDKYGWGYMS ELMKNQPRFI RGHLGIAQEI AAGNAALSFD
ASVTTITRAQ AAGGTIEAAF SQTDPMPIWD NRLCIFKDAP HPNAARLFQT WLLSREHQIA
AGLWSTRRDV EPEGGKFKGI DEYFTAHNFK DFILQPADEL QELRVRFESF IGPVEGMPVL
SAPAKN