Gene GM21_3731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3731 
Symbol 
ID8139105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4298566 
End bp4299723 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID644871350 
Producthypothetical protein 
Protein accessionYP_003023508 
Protein GI253702319 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGG CTTATTTCGA CTGTTTCGCG GGAATCGCCG GCGACATGAC CGTCGCGGCG 
CTGATCGAAC TGGGTCTCCC GCTCGAGGTT TTGCGGCGGG AACTGGCGGG GCTGCCGCTT
TCGGGCTACG CGCTGGAAAG CTCGAAGGTG GAGCGGCACG GCGTGGCCGG GACCTCCTTC
AAGGTGACCC TCACCGAGGC GGACCAGCCG CACCGGCACT ACAGCGGCAT CGCGAAGATG
ATCGACGAGT CCGGCCTGAA ACCGAGGGTG AAGGAGCTCG CGCAGCGGAT CTTCAGAAGG
CTCGCCGAGG CGGAGGCGGC CGTGCACGGC GTCCCCCTGG AGCGGGTGCA CTTCCACGAG
GTGGGCGCGA TCGATTCCAT CGTCGACATC GTGGGGACCG CCATAGGGCT CGACTACCTG
GGGGTGGAAG CGGTCTACGC CTCCGGGCTT CCCTACGGCA GGGGGTTCGT GCAGACGGCC
CACGGCAGGC TCCCGGTCCC GGCACCGGCG ACCGCCAAGC TGATGGAGGG GATCCCCTTG
ACCTTCGACA TCGGCGAGGG GGAGCGGGTG ACGCCGACCG GAGCGGCCAT CATCGCGGCG
CTGGCCGAGG GGTTCGGCCC GCCGCCTTCC CTGACGCCGC TTGGAACCGG CTACGGCGCA
GGGGAAAAGG ACTTTCCGGA GCTCCCGAAC CTGCTCCGGG TGCTCCTGGG GGAAAGGGCG
GAGGGAAAAG GGCACCAGGA GGTGCTGGTC CTCGAGACCC ACATCGACGA CATGAACCCC
GAGATCTTCG GCTTTCTCAT GGAGAGGCTC CTGGAGGCGG GGGCGCTCGA CGTCGCCTTT
TCGCCCCTGC AGATGAAGAA GAACCGCCCC GCCACCCGCC TGACCGTGAT CGCGGACCCC
GCCGACCTGG AAAAGCTCTC GGCCATCGTG CTCTCGGAAT CGACCGCCAT CGGCCTGCGC
TACTACCCCG CCCGCCGCGT CACCGCCGCG CGCCGCTGCG AAACCCGGGA AACCACCCTG
GGCGAGGTCG CGGTGAAGGT GCTGGAAACT GGACGCGTGA CGCCGGAGTA CGACTCCTGC
CGCAAGATCG CCCTGGAGAA GGGCATCCCG CTCATCGAGG TGTACCGCAC CGTGGAAAGG
GAGTGCGGTC AGGCATGA
 
Protein sequence
MKVAYFDCFA GIAGDMTVAA LIELGLPLEV LRRELAGLPL SGYALESSKV ERHGVAGTSF 
KVTLTEADQP HRHYSGIAKM IDESGLKPRV KELAQRIFRR LAEAEAAVHG VPLERVHFHE
VGAIDSIVDI VGTAIGLDYL GVEAVYASGL PYGRGFVQTA HGRLPVPAPA TAKLMEGIPL
TFDIGEGERV TPTGAAIIAA LAEGFGPPPS LTPLGTGYGA GEKDFPELPN LLRVLLGERA
EGKGHQEVLV LETHIDDMNP EIFGFLMERL LEAGALDVAF SPLQMKKNRP ATRLTVIADP
ADLEKLSAIV LSESTAIGLR YYPARRVTAA RRCETRETTL GEVAVKVLET GRVTPEYDSC
RKIALEKGIP LIEVYRTVER ECGQA