Gene GM21_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0736 
Symbol 
ID8136051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp879255 
End bp880253 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID644868353 
Productprotein of unknown function DUF534 
Protein accessionYP_003020568 
Protein GI253699379 
COG category[R] General function prediction only 
COG ID[COG2984] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00000000204583 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCAAGA ATTTTCGTAG CGTGATGCTC TGTTTCGCTT TGTCCCTGGT CTGCGCCGCA 
ACGGCTTTCG CGGCGGCCCC CTCCAAGCCG GTGCTGATCG GCATCTCGAA GATAGTTTCC
CATCCGGCGC TCGACTCGGT GGTCAAGGGG GTTCAGGACG AATTGAAGGA CGCCAGGGTC
AACGCGATCT TCGACGTGCA AAACGCCAAC GGCGACATCA ACACCGCGGC CTCCATCGCC
AACAAGTTCC GGTCCCAGAA GGTGAACCTC GCCGTCGGCG TCGCCACTCC GACCGCCCAG
GCCCTGGTTA ATACGCTCAA GGGGATCCCC ATCGTCTACT CCGCGGTCAC CGATCCGGTG
AAGGCGGGCC TCGTTCCCTC CCTCGCCAAG GGGGGCAAGA ACGTAACCGG CGTATCCGAC
ATGACTCCGG TCCGGCAGCA GATCGAGATG CTGCTCAGGA TCAAGCCCAA GACCAAGCGC
ATCGGCCACA TCTACACGAG CTCCGAGGAG AACGCCGTGG TTCTTGCCGC AATGGTGAAG
CAGGTGTGCA AAGAGAAGAA GCTCGAATTC GTGGAGACCA CCGTCACCAA GTCGGCAGAG
GTGAAGCAGG CGGCCCAGGC GATCGCGCAC CGCGTCGACG CCTTTTACAT CAGCACCGAC
AACACCGTGG TCTCCGCCAT GAGCGCGGTG GCGGATGTGG CGAAAAAGGC GAAGATCCCC
ATCATGTCCG CCGACCCGAG CTCCTCCGAG ACCTATGACG TCCTCGCCGC CTGGGGCTTC
GACTACTACA AGATGGGGCG CGCCACCGGC AAGGTTGTGA TCGAGATCCT GAAGGGCAAG
AAGCCCGAGC AGATCCCGAC CCGCTTCATG ACCAAGGCCT CCGACGTCGA CCTGCTGATC
AACCTCGACG TGGCCAAGAA GCTCGGCCTC ACCGTCCCGG CGGACATCGT GAAGAGCGCG
AAGACCATAC GCCAGAACGG CAAATTGACC AAGAAGTAA
 
Protein sequence
MSKNFRSVML CFALSLVCAA TAFAAAPSKP VLIGISKIVS HPALDSVVKG VQDELKDARV 
NAIFDVQNAN GDINTAASIA NKFRSQKVNL AVGVATPTAQ ALVNTLKGIP IVYSAVTDPV
KAGLVPSLAK GGKNVTGVSD MTPVRQQIEM LLRIKPKTKR IGHIYTSSEE NAVVLAAMVK
QVCKEKKLEF VETTVTKSAE VKQAAQAIAH RVDAFYISTD NTVVSAMSAV ADVAKKAKIP
IMSADPSSSE TYDVLAAWGF DYYKMGRATG KVVIEILKGK KPEQIPTRFM TKASDVDLLI
NLDVAKKLGL TVPADIVKSA KTIRQNGKLT KK