Gene GM21_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2449 
Symbol 
ID8137790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2856783 
End bp2857994 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content65% 
IMG OID644870059 
ProductVanZ family protein 
Protein accessionYP_003022250 
Protein GI253701061 
COG category[S] Function unknown 
COG ID[COG5652] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones168 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGAA AATCGCTCTT TCCGGCCTTG GCCTGCGTCC TATTGATCGC CTACGCGTCT 
CTCTTCCCCC TAACCGGCTG GCGCCTACCC GGCGCCGGCT TTTTCGCCTG GTGCACCATT
GAGCTTCCCG GCCGCGTCTC CAAGAGCGAC CTCCTCGTCA ACGTCATCGC CTACGTCCCT
TTGGGCTACC TCCTTTTCCG CCTGTTCCGC CGGGATGACG GGCGCATCGC CGTTGCCTTT
CTCTGCGCGC TCGCCGCCGG AAGCGCGCTC AGCTTCGCCA TGGAATTTAT CCAAGCCTTC
CTCCCAAGCC GCACCCCCTC CGTCGTAGAC CTCTGCACCA ACACCCTGGG AACCTGCGCC
GGCGCGCTCC TCGCGCTTTG CCGGCAGCAA GCGGCCGTGC CCGAAGGGGC CTGGTCCCGC
TGGCGGGCGG GCTTTCTCAC AGCCGGTAGC CGGGGGGAGC TTGGCCTTTG CGTCCTTTTG
CTCTGGCTTT GCTCCCAATG GGCACCCTTC GTCCCCTCGC TGGACTTAGG CGGCGTAAAG
AACGGGCTCA AGCCCCTTTG GCAAACGGCG CGCGGACTGT CGCGCTTCGA CCTGGCGCAG
GCCGCCACCT ACTTTTTCTA TTTAGCCGGG CTCGGGGTGG TGGCGCAGGA GACCTTCCGG
CGGCGCGCCC TTGCCCTCCC CCTTTTCTCG TTTTTTGCGG CGGGCGTTCT CTGCGCCAAG
ATCTTCATCC AGGGGCGACA GCTCTCCCTG GAGGCGTTAG CGGGGTTGTT AGCTGCTGTT
CCGCTTCTGG TCGCGGCGGG GCTTATGGGG GAGAAGTCGA GAAGGGTTTT TTCGGGGTGC
CTGCTGCTTA TCGCCGGTTT CGCCTTCTAC GAGTTGAAGC CGGGTGTGGG GAGTGTAGCG
GGAGGGTTCA GTTGGATACC GCTGCAGGGG CAGTTGGCAC ACGAGCTGAG CGGCTTCGGG
ACCATACTCG AAGGAGTGTG GCCTTTCGCG GCCATGGCGC TCCTGGTCGC GCCGGGGCGG
GAAGAAGCGA GGGGTTCGTC CGCGCCGGGA GCGGCAGCCG TCTTTTGTTT CGTCTTCGCG
CTGGAGTGGG TGCAGCTAGC GATCCCCGGC CGCACCCCCG ATCTGACCCA GGCGCTGTTG
GCGCTCGCCG GTTGGCTCGC ACCGGCCTTC TATCTGCGAC AGGCGGAGCT GCGAGGTTGG
ATTCGGACCT GA
 
Protein sequence
MKRKSLFPAL ACVLLIAYAS LFPLTGWRLP GAGFFAWCTI ELPGRVSKSD LLVNVIAYVP 
LGYLLFRLFR RDDGRIAVAF LCALAAGSAL SFAMEFIQAF LPSRTPSVVD LCTNTLGTCA
GALLALCRQQ AAVPEGAWSR WRAGFLTAGS RGELGLCVLL LWLCSQWAPF VPSLDLGGVK
NGLKPLWQTA RGLSRFDLAQ AATYFFYLAG LGVVAQETFR RRALALPLFS FFAAGVLCAK
IFIQGRQLSL EALAGLLAAV PLLVAAGLMG EKSRRVFSGC LLLIAGFAFY ELKPGVGSVA
GGFSWIPLQG QLAHELSGFG TILEGVWPFA AMALLVAPGR EEARGSSAPG AAAVFCFVFA
LEWVQLAIPG RTPDLTQALL ALAGWLAPAF YLRQAELRGW IRT