Gene GM21_3109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3109 
Symbol 
ID8138459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3605163 
End bp3606230 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID644870713 
ProductNHL repeat containing protein 
Protein accessionYP_003022895 
Protein GI253701706 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones150 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCT GCTTCCTTTT CCGCTCGCGC TACCTCGTCC ATGCGGCGGC GTTTTTGGCG 
GCGTTGCTCC TTCTGGCCGC GCCGGGTTGC GCCGTCAACG AGGCGGCCCG GATCCCGCAG
CCCGTCGACA AGGTGGTCTG GCCCCCCCCG CCCCTCGAAC CCCGCGTGGC CTGGGTGCAG
CTGATCCGGA ACTCGAACGA CTCTGGCATC GAAAAAGGAT TTTTCTCAAG GGTATCCGAC
CTTTTGTTCG GCGAGGAGGT CCTGCGGGTG AGCCGCCCTT ACGGCATCCA CGTGGACAAG
AAAAAGAGGG TCATCTTCGT CAACACCGGC ACGGGGAGCG TCCATGTCAT CGACCGCGGC
GCCGGTCGTT ACGGCGTGGT GACCGGCCCC GAGGGTGAGC CATTCCTCTC CCCGATAGCG
GTGACCGAGG ACCCCGACGA GACCGTCTAC GTGACCGACT CCGCGGCGGC GAAGGTTTAC
CGTTTCAACG CCTCGGACCT GAAGGTGGAG CCCTTCATAA CCACCGGTTT GCAAAGGCCT
ACCGGCATCG CCTACAACCC GGCGACCGAT CTGATCTACG TCACCGACAC CGTGGCCGGG
CAGGTCGTCG CCTTCACCAG AAAGGGGAAG GAGGCGTTCC GGTTCGGCTC CCCCGGCAGC
AAGCCGGGCC AGTTCAACCA CCCGACGGAC ATAGCCGTGG ACGCCAAGGG GGGGATCGCG
GTCACCGATC CTTTGAACGG CCGGATCCAG ATCTTCTCCG GCAAGGGGGC GTTCCTCGCC
GCCTTCGGCC GGATGGGGAA CACCTCGGGA AGCTTCGCCA AGCCCAAGGG GGTGGCGGTC
GACAGCAGCG GTAACCTGCA CGTCTGCGAC GCCCTGTTCG ACACGGTCCA GGTGTTCAAC
CCGCGGGGGG AGCTCCTGCT CAATTACGGG ATCAGGGGGG GGGAGAGGGG GGAATTCTGG
ATGCCCTCCG GCCTCTACAT AGACGGCGAA GACGCCATCT ACGTGGCGGA CACCTACAAC
GACAGGATCC AGGTGTTCCA GTACCTGAGG GACGTGACCG AGAACTAG
 
Protein sequence
MDRCFLFRSR YLVHAAAFLA ALLLLAAPGC AVNEAARIPQ PVDKVVWPPP PLEPRVAWVQ 
LIRNSNDSGI EKGFFSRVSD LLFGEEVLRV SRPYGIHVDK KKRVIFVNTG TGSVHVIDRG
AGRYGVVTGP EGEPFLSPIA VTEDPDETVY VTDSAAAKVY RFNASDLKVE PFITTGLQRP
TGIAYNPATD LIYVTDTVAG QVVAFTRKGK EAFRFGSPGS KPGQFNHPTD IAVDAKGGIA
VTDPLNGRIQ IFSGKGAFLA AFGRMGNTSG SFAKPKGVAV DSSGNLHVCD ALFDTVQVFN
PRGELLLNYG IRGGERGEFW MPSGLYIDGE DAIYVADTYN DRIQVFQYLR DVTEN