Gene GM21_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2672 
Symbol 
ID8138014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3108328 
End bp3109530 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content59% 
IMG OID644870276 
Producthypothetical protein 
Protein accessionYP_003022466 
Protein GI253701277 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0000390899 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAGA GTCGCAATTC AGCGCACGGC TACACCATGG TCGAGCTGGT GGTGGTGATG 
CTGATCTTCT CGGTGGTGAT GACCCTGATC AGTGTCTCGT TCTCCCGAAT GGTTCGCGGT
TCAGGGCAGC TGCTTAAAAG CGCCGAAACC GACATCGGAG GACTCATCGG GCTGGAACTG
ATGCGCAGCG ACATGGAGTC CGCCGGATTC GGGCTGTACT GGCGCGGTCC TGGTGGCGGC
GCCTCCGCCG TCAGCTACAT GGAAGCCCCG GACGACCTGG TGCTGGTGAA AGACTGCCCA
GACGCGAAAC CCTCCCTTTT CGACGACCGT ACCTACGACA CAGACCGATA CATCCCGCGG
GCCTATCGGG TCGGCAACAA TGTAGGCTAC AACGGCTCCG ACTACTTAGT GATCAAAGGG
ACCACATTGG GGACGAACAA GGTGTCCCGG GCTTGGGGCT ACCTTAACTA CAGCACGGGG
AGGGTGGTTG TCCCCCCACG AGACGCAGTG AGTGAGCCGT TCAAAACAAG TGATCGTACC
ATCGTCCTGA AAAGCGGGAT CTCAGCAGGA AGGGAGGTTC GGGAGCTGGT GCTGGACCAT
ACTGAGTTCT CCGTATCCTA CGCCTATTCT TTCCCGGAGG CCTTCAGTCC CAAAGAACCT
GGCGACAATT TTCTCGTCTA CGGAGTGGAT AAGGCAGATG ATTCGGGGGC GAAGCTCAGC
TTTCCGTTCA ACAGGGCCGA CTACTACATC AACCGTCCCA GCGACATATC CCGAACCTGC
GCTCCCGGGA CTGGAATCCT GTACAAGACG GTAATCACCC AGAGTGGCGA CCCGGCCTAT
TATCCCATCC TCGATTGCGT AGCCGACATG CAGGTCGTGC TCCCCGTGGA CTCCAACGGG
GATGGTGCGA TCGATTACCA CCTCGACGCC GAGGAATTGG ACGCGCTGGA GACACGTGAA
CAGCTGAAGG AGATCCGGGT CTACATCCTG GCGCAGCAGG GAAGGAGGGA CGCCTCCTAT
CTCCACCCTG TGGCGGATCC AGAGAGGGCT TTTCTTGTTG GCGACCCGGA ACTTATCAAG
AAGCCGGAGA GCGCACTTGG GCGCATCTGG AGTTCGAGCA AAATGGGGGA CACCTTCGGC
GGCCACTGGC GCAACTACCG CTGGAGGCTT TACACCATCG TGGTGCAGCC CAAAAACCTG
TAG
 
Protein sequence
MSKSRNSAHG YTMVELVVVM LIFSVVMTLI SVSFSRMVRG SGQLLKSAET DIGGLIGLEL 
MRSDMESAGF GLYWRGPGGG ASAVSYMEAP DDLVLVKDCP DAKPSLFDDR TYDTDRYIPR
AYRVGNNVGY NGSDYLVIKG TTLGTNKVSR AWGYLNYSTG RVVVPPRDAV SEPFKTSDRT
IVLKSGISAG REVRELVLDH TEFSVSYAYS FPEAFSPKEP GDNFLVYGVD KADDSGAKLS
FPFNRADYYI NRPSDISRTC APGTGILYKT VITQSGDPAY YPILDCVADM QVVLPVDSNG
DGAIDYHLDA EELDALETRE QLKEIRVYIL AQQGRRDASY LHPVADPERA FLVGDPELIK
KPESALGRIW SSSKMGDTFG GHWRNYRWRL YTIVVQPKNL