Gene GM21_1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1171 
Symbol 
ID8136493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1363066 
End bp1364331 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID644868782 
Productprotein of unknown function DUF445 
Protein accessionYP_003020990 
Protein GI253699801 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.23014e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGGATG AGAGAAGAGC TGCACTGAAA AAGAACAAGA TGATCGCCCT AGCCCTCATG 
GCGGGCGCCG CCGCCCTATT CGTCGTGGCC CGTCTGCAAC GGGGAAGCAG CGGCTGGGAA
TGGGTGGCCG CGTTCGCCGA GGCCGCCATG GTCGGCGCCC TTGCCGACTG GTTCGCCGTG
GTGGCGCTGT TCAGGCACCC GCTGGGACTT CCCATACCCC ACACCGCCAT CATCGCCGAC
AAGAGGGAAA CCATCGCCGA CAACCTGGCC CGCTTCATCC AGGAGAAGTT CCTGACCACA
GAGGTCCTGG TGGGAAAGAT GAGGGAATTC GACCCCGCGC GGCAGCTTTG CCGTTATCTC
ACCTCGCGGG ATAACGCGCA GGGGCTGGCA AGGGGGGTGG CTCGCATCCT CTCCGAATCG
ATCGGTTTCC TGGAGGACGA GCGGGTCAGC AGGATAATCA TGGCCGCAAT GCACGACCGG
ATCGGCAAAT TCGACATGGC CGGCTCCGCG GCGAGCCTGC TTGAGTCCCT GAGAAAGGAC
GATCGCCATC AGGCCGTTCT GGACGAGATC CTGCGGAGGC TGGGCGCCTG GCTTGGGACC
CCCGAGTCGC AGGAGAAGAT CGCCATGGCG CTGGACAACT GGGTCGACAC CGAATACCCG
CTCCTCAGCA AGTTCATCCC GAACCGTCCG CAGTTCTCCA GGAACGCCGG CGAAAAGATC
ATAGGTAAGG TGAGCGGCTT CCTGCACCTG GTCAACGCCG ACCCGGACCA CGAGCTGCGC
CAGGAGTTCG ACCGGGCAGT GGGCGACTTC ATCGTGAAGC TGAAACACGA CTCCGGCACC
AGGGAGAAGG TCGCCGAGCT GAAGCGGGAG GTGATAGATA ACGAGCAGCT CTCGCTGTAC
GCCAAGAGCC TCGTTGCCGA CCTCAAGCAA TGGATGGTGG AGGACCTGGA CCGGCCATCA
TCAAGCATCC GCGCCAAGAT AGCCGACGCC GCCGTTGCCC TTGGCAACAC CCTGTCCGAG
AACTCCGATT TGGCCGATTC AGTGAACGAG CACCTGGAGC GCGTGGTGAG AAAGTACGCC
GACAACCTGA GAGCCGGGTT CTCCCGTCAT GTAGCAGGTA CCGTAAAAGA GTGGAAGGAA
GAGGAGTTTA TCGAGGAGAT CGAGCTCAGC ATAGGGAGCG ATCTGCAGTT CATCAGGATG
AACGGCACGC TCGTCGGCGG CATGATAGGC CTATTGTTGC ACGCGGTGTC GCTGCTTCTA
GGATAA
 
Protein sequence
MLDERRAALK KNKMIALALM AGAAALFVVA RLQRGSSGWE WVAAFAEAAM VGALADWFAV 
VALFRHPLGL PIPHTAIIAD KRETIADNLA RFIQEKFLTT EVLVGKMREF DPARQLCRYL
TSRDNAQGLA RGVARILSES IGFLEDERVS RIIMAAMHDR IGKFDMAGSA ASLLESLRKD
DRHQAVLDEI LRRLGAWLGT PESQEKIAMA LDNWVDTEYP LLSKFIPNRP QFSRNAGEKI
IGKVSGFLHL VNADPDHELR QEFDRAVGDF IVKLKHDSGT REKVAELKRE VIDNEQLSLY
AKSLVADLKQ WMVEDLDRPS SSIRAKIADA AVALGNTLSE NSDLADSVNE HLERVVRKYA
DNLRAGFSRH VAGTVKEWKE EEFIEEIELS IGSDLQFIRM NGTLVGGMIG LLLHAVSLLL
G