Gene GM21_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1192 
Symbol 
ID8136517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1384300 
End bp1385517 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID644868806 
ProductNHL repeat containing protein 
Protein accessionYP_003021011 
Protein GI253699822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones99 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATTT CTTTGCAGAA AAAGGTAGGG TGGGCCGCAC TGCTTTTGGC CATGTTGGCT 
TCGTTTTCGC CCACGACAAC CGCGATGGCC GGGACGGCAC CGGTGACGAC AGTGTTGACA
CCGCCCATGT ACGAAAACTT GAGCACTCCG GTCTCCATCG CGATGGACCC GCGCGGCTTT
TACTACGTGG CGGACCCTCG CAAGGGTACC GTCAACAAGT TCGATCTCTC GGGGAAGTTG
CTCCAGACAT TCCGGCTCGC CTCCGCCCCT CGTGCGGTTG CCCTTGATAA CAGCGGCAAC
CTGCTGGTGA GCCTGGGGAC CTCGGTAGCA CTGGTCGATC AGCAGGGTGC GCAGCAGCTG
CTCCTCGGCT CGGGGCAGGG GCAGTTCCAA TACGCCGTCG GCATTGCGGT CGATGCCCGG
GGCTACATCT GGGTCGCCGA CAATCAGGCG CACAACGTAA CCCTTTTCAC CTCGGCCGGA
ACAGTGGTGA AGACCATCGG GGGGCTGGGT AACGGCACCG GGCAATTCGA CTTCCCGGTG
GGGGTCTCCT ACGAGAAGGT TGCCGACCAG GTGGTGGTGG CCGATGCCGG CAACCACCGG
CTGCAATTTT TCGACGCTGC CGGCAACAAC GCCTTCGTCA AGTCCATCGG GAGTTTCGGC
TCCGGCCCGC TCCAATTCCA ATATCCGGTA GGCTCCGGGT TCGAGTACAG CCAAGCCGGC
CAGTTGAACC GGATGTACGT GGCGGACCAG CATCTTAGTA CGGTGCAGGT GCTCGACCCG
GCCGGAATCG GCACCTTCCT GAAGTTTATC GGGTACAGCG GGCTGGTGAG CGGCACTGTC
ACAAATCCCA TGGCAATTGG CTTCGATGCG ACCAACAAGC GCCTCATCGT CGTGAACGGG
AACGGGCGTC TGCACCTTTT CGGCATCGAC GGAGGGGTGA ACCCGGGACT GCCGCAGGGG
CTCGTGATCG ATCCGGTACT TACCGAGGTC AAGGGGCCCA GCTTGACCAT CACTGGAACC
TTTCCCCCCA ACACCACCGT GGAGATGCAG ATCAACGGCA TCGCCGCGTC CGAACCGGTA
TCCTACACCT CAGCCTCCAC CTGGAGCGCC ACCTTCCAGA ATCTCGTTCC CGGCAGCAAC
GTGCTGACGG CGATTGCCCG GGATGCCCGC GGTTACATGA TCACGAAACA AGCCATCGCC
ATCCTCTATT CCCCGTAG
 
Protein sequence
MLISLQKKVG WAALLLAMLA SFSPTTTAMA GTAPVTTVLT PPMYENLSTP VSIAMDPRGF 
YYVADPRKGT VNKFDLSGKL LQTFRLASAP RAVALDNSGN LLVSLGTSVA LVDQQGAQQL
LLGSGQGQFQ YAVGIAVDAR GYIWVADNQA HNVTLFTSAG TVVKTIGGLG NGTGQFDFPV
GVSYEKVADQ VVVADAGNHR LQFFDAAGNN AFVKSIGSFG SGPLQFQYPV GSGFEYSQAG
QLNRMYVADQ HLSTVQVLDP AGIGTFLKFI GYSGLVSGTV TNPMAIGFDA TNKRLIVVNG
NGRLHLFGID GGVNPGLPQG LVIDPVLTEV KGPSLTITGT FPPNTTVEMQ INGIAASEPV
SYTSASTWSA TFQNLVPGSN VLTAIARDAR GYMITKQAIA ILYSP