Gene GM21_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1703 
Symbol 
ID8137034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1983181 
End bp1985523 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content62% 
IMG OID644869315 
ProductPTSINtr with GAF domain, PtsP 
Protein accessionYP_003021515 
Protein GI253700326 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value2.57578e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGAGG AGACTGGGGA ACAACTGGGA CTGAGGACAC TGGAGGACAT CAGCATGCTG 
ATCCTTCACT CCCACGATCT TCAGGAGACC CTGGATAACA TAGTAAACCT CGTCGCTAAG
AGGATGTCCT CCGACGTCTG TTCCATCTAC CTTCTGGAGG AGGACGGCGA GACGCTGAGG
CTGCACGCGA CCCGCGGCCT CTCCAGGCTG TCCGTCGGCA TCACCATGAA GACCTCGGAA
GGTCTCACCG GCCTTGCCGT CGAGCAGCGC GGCGTAGTGG CGACGGACAA CGCGCCGCGG
CACCCTCGCT ACAAGTACTT CAGGCAGACC AAGGAGGAGA AGGTCCTCTC CTTCCTCGGC
GTCCCCTTCT TCGAACGCAA CAACCCCATC GGCGTCCTTG TCATACAAAA CCGCGACGCC
CGCACCTTCA CCTCCCAGGA AATCAGCGCC GTCTCGACCA TCGCCTGGCA GATCTCCAGC
ATCGTCTCCA ACGCGAAGCT CCTCGACTCC ATAAGGAAGA AGGAAGAGGA GCGAGCCTTT
TACGCAGCCG AGGTGGACCG GCTCCGGAAA ACGGGGGTGC TCAAGGACTC CGGCCGCGCG
AGCCGCAAGA GCCAGGGATC GGGCGTTTTG ACGGGAATCG GGATCTCCCC CGGTTTCGCG
ATGGGGAGAA TATCGGTACT GCACCGCGGT GCGAGCGAGG AAGCGGTCCT TGAACAGGCG
CGGCCGCGCG CCGAGGAACA GACCCGCTTC CTGCACGCGC TGGAAAAGGC GCGGATCCAG
ACCATCTACA TGGAAAAGCG GGTGGGCCAA ATCCTCTCCG AGGCCGACGC CGCCATCTTT
CACAGCCATC TGATGATACT GGAGGACCGC GGCTTCATCG GGAAGATCGG CGCGCTGATC
GACGAGGGAC TGGGAGCGCA TACGGCCGTA AGCCAGGTGG TGGAGAACTA CGTGGCGGCC
TTCGCCCGGA TGCAGGACCC TTACCTGCGC GAGCGCAGCG CCGACATGGA AGACATCGGG
CGCAGGATCT GCGACGCGCT GAACGGCAGC AACCACAAGC ACCGGGAGCG CCTGCGCGAC
CCCCGGATCA TCATCGCACG CGAACTGCTC CCCTCCGATC TCGCCATCAT GGACCACGGC
AAGGTGACCG GCATCGCCAC CGAAAAGGGA AACCAGAACG CGCACGCGGC GATCATGGCC
CGGGCGCTCG GCATTCCTGC CGTCTTCGGG GTAGAGGGGC TGTTGAAAAA GGTAGGGGCT
CGCTGCGAGG TGGTCGTTGA CGGCAACTCC GGCTGCGTCT ACATCAACCC GGACCAACGC
ATCAAGAAGG AATACCAGCG GCTACAGGGG GAGTTCGACC AGAAGCGGCG CGAGCTGGAA
GGAATCAGGG ACCTCCCCGC CGTGACCACC GACGGCTGCA CCGTGTCTCT TCTGGCCAAT
ATCGGTCTTT TGAGCGATTT GAGGGTAGCG CAGGCGCACG GCGCGGAGGG GGTCGGGCTG
TACCGGACCG AGTTTCCCTT CATGAGCCGT AACTCGTTCC CCGGACGTAC GGAGCAGGCC
GCCATCTACC GCAAGGTGCT GGAAGGGTTC CCCGGGCTCC CGGTCACCAT CAGGACCCTC
GACATAGGCG GAGACAAGGA GCTTTCCTAC TTCCCCCACC CAAAGGAAGA CAACCCCTTC
CTGGGGTGGC GCTCCATGCG CATCTCACTC GACCGCGAGG ACATCTTCCG GGAGCAGTTG
GCGGCGGTGC TTTTGGCGTC GGCCAGTGGC AGGTGCAACA TCATGTTCCC CATGATTTCC
GGCGTGGACG AGGTGCGCCG CATCAAGGGG ATACTGGAGC AGGTGAAAGA AGAGCTGAGA
AAAGAGGGGA AGGAATTCGC CCAGGATATC GGGCTCGGGG TCATGGTGGA GCTGCCGGCT
GCGGTGATGG TGGCGGAGAT GCTGGCCCGC GAGGTCGATT ATCTGAGCAT CGGGACCAAC
GACCTGATCC AGTACACACT TGCCTGCGAC AGGAACAATC CGAGGGTGAA GAAGTGGTAC
GACCCATACC ATCCTGCGGT CTTGCACTCG ATCAAGAAAG TGGCCCAAGC AGCCGCTAGC
GCGGGAAAAC CGGCGTCGTT ATGCGGCGAG ATGGCGGGAG AACCGGTCAA CGCGGTCCTT
TTGCTGGGGC TTGGGATGCG CTGCTTCAGC CTTTCCGCGC CGAACATACC GCGGGTGAAG
GAGGCCATAC GCGCCATTTC CCTGGGGCAG GCGGAACAGA TAGCCGGGCA GGTGCTGCGC
ATGGAAAGCG CCGTCTCTAT CAAGAGCTAC CTGGAATCGG TACAGCGCGA GCTGGGGCTA
TAG
 
Protein sequence
MPEETGEQLG LRTLEDISML ILHSHDLQET LDNIVNLVAK RMSSDVCSIY LLEEDGETLR 
LHATRGLSRL SVGITMKTSE GLTGLAVEQR GVVATDNAPR HPRYKYFRQT KEEKVLSFLG
VPFFERNNPI GVLVIQNRDA RTFTSQEISA VSTIAWQISS IVSNAKLLDS IRKKEEERAF
YAAEVDRLRK TGVLKDSGRA SRKSQGSGVL TGIGISPGFA MGRISVLHRG ASEEAVLEQA
RPRAEEQTRF LHALEKARIQ TIYMEKRVGQ ILSEADAAIF HSHLMILEDR GFIGKIGALI
DEGLGAHTAV SQVVENYVAA FARMQDPYLR ERSADMEDIG RRICDALNGS NHKHRERLRD
PRIIIARELL PSDLAIMDHG KVTGIATEKG NQNAHAAIMA RALGIPAVFG VEGLLKKVGA
RCEVVVDGNS GCVYINPDQR IKKEYQRLQG EFDQKRRELE GIRDLPAVTT DGCTVSLLAN
IGLLSDLRVA QAHGAEGVGL YRTEFPFMSR NSFPGRTEQA AIYRKVLEGF PGLPVTIRTL
DIGGDKELSY FPHPKEDNPF LGWRSMRISL DREDIFREQL AAVLLASASG RCNIMFPMIS
GVDEVRRIKG ILEQVKEELR KEGKEFAQDI GLGVMVELPA AVMVAEMLAR EVDYLSIGTN
DLIQYTLACD RNNPRVKKWY DPYHPAVLHS IKKVAQAAAS AGKPASLCGE MAGEPVNAVL
LLGLGMRCFS LSAPNIPRVK EAIRAISLGQ AEQIAGQVLR MESAVSIKSY LESVQRELGL