Gene GM21_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1521 
Symbol 
ID8136850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1777163 
End bp1778470 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID644869133 
Productconserved repeat domain protein 
Protein accessionYP_003021335 
Protein GI253700146 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.00000177441 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATTAG TCCATCTTAA TAAATACGGC GCAAGGGTTA AACGCCCGTC CTTCCTGCTG 
GCGCAGCTTT GCTGCGCATT GCTTTTTCTG CTTCTCATGG GAAAAGAGGC CTTTGCGGCA
TACCAGGCCG ACCTCATGGT GAGGCTTGCC AACGAGGGCG ATTCCTCCTA CGCGGGCGCC
GGGATATTCG AGACCACAGC GGTGATCCAG TCCAAATCCC AGGGTTCCTA TTCCGGGTAC
CCGGCGCAGT TCCGGGTCCA GGTGAAGAAC GCGGGCGACC AGACGGACAG CTTCGTCCTC
ACCGGTCCCG CCGCGGGGAG CGGCTTCACG GTGAGCTACC GGGACCAGGG AGGTGTGGAG
CGTGCGGCCC AGTTCGCCTC AGGAGGGTAC CGGACCCAAT CCCTTGCCCC AGGCGCCTCC
GTCGTGCTGC TGGTGCAGGT GACGCTGAGC CGGTTCACCC CGGGGGCGAG CTACCGCGTC
CCCGTCACCG CGGTATCAGC GGGCGACCCT GCCGGGGCGG ACCAGGTGAA GACGGAGACC
GTCGCCTGTG GCCTCGCCGC CGCTGTCACC GTTTCGGCGC CCCCCGACGG CTCCGGTGCG
CCAGGCTCCC TGGTGCTCTA TCCCTACACC GTCACCAACG TCGGCAACGC CGTGAACAGC
TTCGCTCTTT CCTTGGAGGG GGGCGCCCCT TGGCCGGGGA TCCTTTACGC GGACGACGGG
GCGGGGGGAG GAATCGCAGG TGACGGGGTT AGGCAGCCCG GGGAAGAAAA CCGCTGCGTC
TCCACCGGCC CGCTCCCCCC CGGCGCATCC CATCGCTTCT TCCTTGCCGT CGCCATACCC
GAGTCGGGGA GCGACGGCGC ACGGGCGGAC GCTCGCCTGA CTGTCACAGG GGAGGGGGCG
AGCGGTAACG ATCAGGTCAC CACCACCGCC CTGGCCGCGG TCCTCTCGCT CGTCGACGGC
GTGCGCAACC TGACCAAGGG GGGGATCTTC GCCTCGGCCG TCGATGCGGT CCCAGGCGAC
CTGCTCCAGT ACCGGATGGC GATCACCAAC AGCGGTTCGG CCCCGGCCAA AGCGGTGCGG
GTCGAGAGCC CGCTGCCGGC CGGGTTGAAA CTGACGCCCG ATTCCATGGT GGTGACTTTA
GCCGCCGATG GTGAGGGGGC GCCTTGCCCG GCGGCTCAAT GCGGCCGTGC CTGGGGAGGC
GAGGGGAACA TCGTCGCCCT TTTGGGCGAG GGGGCCGGCG ACGCCGTCGG CGGCTCTCTG
CCGCCGGGAA AGACCCTTTA TCTTTTTTTC AAAGCGCAGG TCGAATGA
 
Protein sequence
MTLVHLNKYG ARVKRPSFLL AQLCCALLFL LLMGKEAFAA YQADLMVRLA NEGDSSYAGA 
GIFETTAVIQ SKSQGSYSGY PAQFRVQVKN AGDQTDSFVL TGPAAGSGFT VSYRDQGGVE
RAAQFASGGY RTQSLAPGAS VVLLVQVTLS RFTPGASYRV PVTAVSAGDP AGADQVKTET
VACGLAAAVT VSAPPDGSGA PGSLVLYPYT VTNVGNAVNS FALSLEGGAP WPGILYADDG
AGGGIAGDGV RQPGEENRCV STGPLPPGAS HRFFLAVAIP ESGSDGARAD ARLTVTGEGA
SGNDQVTTTA LAAVLSLVDG VRNLTKGGIF ASAVDAVPGD LLQYRMAITN SGSAPAKAVR
VESPLPAGLK LTPDSMVVTL AADGEGAPCP AAQCGRAWGG EGNIVALLGE GAGDAVGGSL
PPGKTLYLFF KAQVE