Gene GM21_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1424 
Symbol 
ID8136752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1675610 
End bp1676770 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content51% 
IMG OID644869037 
Productextracellular repeat protein, HAF family 
Protein accessionYP_003021240 
Protein GI253700051 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0000000000000371296 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGGAC TCGCTGCAAA ACTTGCAAGG TTAGGTACTT TTTCTTTTAC GCTGCTTTCC 
CTGATACTTT TGATCGAAAC TGAAAGTTGG GGGAGGGATG GCGCCAGTGT GCCCTCTGAA
CGAAGGATTC TCTCTGTATC AGTCCAGGAT CTTGGAACCT TCGGTGGATT ATACAGCTGG
GCTTCAGATA TCAATGACAA AGGGCAAGTG GTAGGAACAA GCCAAACGTC TACAGGCGCT
AGCCGCGGCT TCATCTGGCA GAACGGGGTG TTAACCGACT TGGGAACCCT GGGATATGCA
ACCACAGCTG GACACATCAA TAACAAAGGA CAAGTCGTAG GTGTTAGCAA AGCCTCTGCA
ACGGTTACCT CTGCTTTCAT CTGGCAGGAT GGTGTCATGA CCGACATCGG GAGTCTCGGA
GGAGGGGGCA CTTCGCCTGC AGATATAAAC GATAAAGGGC AGGTGATAGG GACAAGCAGA
ACCTCTCAAG GTGCCATGCA CGCATTCATT TGGCAGGAAG GAGTGATGAC CGATCTAGGA
ACTCCTGACG GTGTTTACTC AACGGCACAG GATATAAACG AGCATGGGCA GGTCATAGGC
CAGATTGCGT CACCAGGGTC GACCGGGCAT GGGTACATCT GGCACGACGG TATTATGACC
GATCTTGGAG AGGGGTTTTA CCCGGAACGT ATTAACGAGA AGGGACAGGT TATCATCAGG
GAGTTTGCTT CTTTTGGCAA CCCGCATGGC TTCCTCTGGC AAGACGGCGT GATGACCGAT
TTGGGAACCT TAGGTGGAAA CGAGTCTGAC GTCATCGATA TAAACGACAA GGGGCAGGTC
GTAGGCCATA GCATGACTAC TTCTGGAGAA ATGCACGTTT TCATCTGGCA TAACGGAGTG
ATGACCGACT TGGGAACGAC TCAAATCGGG GGTTTTTACC CGAGAGACAT CAACGATAAA
GGAGAGATCC TTGGGGTAAG GAGTCAAGCC TCGGGAATCG TTCAGCCCGT CCTTTGGCAA
AAGGGCACCA TAACCGAACT GGGAACGCTT GGCGGGGAGT GCAACGCCCA CGTTCTAAAT
AACCACGGGC AAGCAGTCGG AAGTAGCCAA ATTTATGCGA ATTCTTATGA GCGGCATCCC
GTTGTCTGGA CGATAAAATA A
 
Protein sequence
MKGLAAKLAR LGTFSFTLLS LILLIETESW GRDGASVPSE RRILSVSVQD LGTFGGLYSW 
ASDINDKGQV VGTSQTSTGA SRGFIWQNGV LTDLGTLGYA TTAGHINNKG QVVGVSKASA
TVTSAFIWQD GVMTDIGSLG GGGTSPADIN DKGQVIGTSR TSQGAMHAFI WQEGVMTDLG
TPDGVYSTAQ DINEHGQVIG QIASPGSTGH GYIWHDGIMT DLGEGFYPER INEKGQVIIR
EFASFGNPHG FLWQDGVMTD LGTLGGNESD VIDINDKGQV VGHSMTTSGE MHVFIWHNGV
MTDLGTTQIG GFYPRDINDK GEILGVRSQA SGIVQPVLWQ KGTITELGTL GGECNAHVLN
NHGQAVGSSQ IYANSYERHP VVWTIK