Gene GM21_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2572 
Symbol 
ID8137914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3003761 
End bp3005020 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content63% 
IMG OID644870180 
Producthypothetical protein 
Protein accessionYP_003022370 
Protein GI253701181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.718881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAGTG TTGAAAATAG GCAGGTAGCT GCGGCGAGGC TCTCTCAGGG CGATTTCCAG 
GCGCTGCCGC TCGACAAGAA GGGGAAGTAC CTGAAGACCG CGCCGGCACG GGAGAAGATG
GAGCTGATCA TCGCCGACCC GGACGCGAAG AGGGTGGCCG CCACCCTGGA GCCCCAGGAA
TTTTTCTGGC TGGTCAAGGA GGTGGGCGAG ACCGACGCGC TGGAACTGCT GCAGGTGGCC
TCCGCCGACC AGTGCGTTTT CATCCTGGAC ATGGAGGCGT GGGAAGGGTG GACCTTTTCC
GAGGAGCAGG TGCATCACTG GCTGGAGAAG TTCATGGAGG GGGGCGAGCC GCGCGTTCAC
GAGCTCTTGA AGCACCTCGA TTTCGACCTG CTGCAGCTCT TCCTGAGCCG TGAGATCGAG
GTCGGCGGAG GGATCGGCGA CCAGTCCAAC GACCAGGAAC GCCTCGCCGA GTACGATCAC
ACCTTCGATG GCGTCTTCAT GATCAACTTC AAGAACCCCA AGCACAGCCA ATTGGTCGGT
ACCTTCCTGT CCATGCTGAT CAAGCTGGAC AATTCCCTCT ACACGGCACT CATGGAAGGG
GCCAAGGGGG AGGTGGACCT GGAGTTGGAG GAGCAGTGCC AGAGGTTCCG CACCGGCAGG
CTTCAGGACC TGGGTTTCCC CCCGCTGGAC GAGGCGCTTT CGATCTACGC CCGGGTAAAC
CCGGAGCATT TCCACCTGGA AGGGGGGAAG GAGTTGAGCC CGGCAGGGGA GGGGGGGCAA
CTGGTACCCG TGGGCGCCGA CGAAGGGACC TTCTTTTCCC GCGCGCTCGC CCTCGCCGCG
ACGCCGACGC TCTACCAGGA GCTGAACTAC CTGGTCAACA GCGCCCTGGT CGCGGAAGGA
AACGCGTTCC ACGAGCCGGA AACGATGCTG GCCATTCTGC ACCGGGTGAG CGGCTATCTC
AACATCGCGC TGGAGAGGCT GGCGCCGGCG GACGAGCAGC GGGCCGCGGA CATACTGGTA
AGCGAGGAGT TGAAGAGGCT GTTCCAACTG GGGTACAGCA TCGTCTTGCA GTTGAAATTC
AGCGCCCGCG ACGTCGAGAC GGCGGACTAC GCTTCCGGGA AGCTGCTGGC GGGGCTTAAG
ACCAAACGCC CCCGGTTCTA CCGCGGGCTG GACCCGGACG GCGTCGACGG CTACCGCGAG
TTCAGGGACC TTTCCGACGT CCAGCGCGTG GCGGACCTTT TGGCCCAGCT AAAACCCTGA
 
Protein sequence
MASVENRQVA AARLSQGDFQ ALPLDKKGKY LKTAPAREKM ELIIADPDAK RVAATLEPQE 
FFWLVKEVGE TDALELLQVA SADQCVFILD MEAWEGWTFS EEQVHHWLEK FMEGGEPRVH
ELLKHLDFDL LQLFLSREIE VGGGIGDQSN DQERLAEYDH TFDGVFMINF KNPKHSQLVG
TFLSMLIKLD NSLYTALMEG AKGEVDLELE EQCQRFRTGR LQDLGFPPLD EALSIYARVN
PEHFHLEGGK ELSPAGEGGQ LVPVGADEGT FFSRALALAA TPTLYQELNY LVNSALVAEG
NAFHEPETML AILHRVSGYL NIALERLAPA DEQRAADILV SEELKRLFQL GYSIVLQLKF
SARDVETADY ASGKLLAGLK TKRPRFYRGL DPDGVDGYRE FRDLSDVQRV ADLLAQLKP