Gene GM21_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1777 
Symbol 
ID8137108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2066287 
End bp2067267 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID644869389 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003021589 
Protein GI253700400 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.00000396246 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGCGC CCCTGTTGCA GGCCGAAAAA CTGGTCAAGC GCTTTGCCGT GCGCGGCGGT 
TTCCTGGCGG AGAAAAGGGA GTTGACCGCC GTAGCCGGCG TCGACCTCGA GATCTTCCCG
GGCGAGACGC TCGGCGTCGC CGGGGAGTCG GGCTGCGGCA AGTCCACCGT GGCGAGGCTT
CTGACCGGGC TCGTCCCCCC TAGCGAGGGG TCGATCCGCT ACGGCGGCCG CGAACTCTCA
GCCATGAACA GGGGGGAGCT CGCCCAGTTC CGCCGCGAAG TGCAGATGAT CTTCCAGGAC
CCCTTCTCCT CGCTGAACCC GAGGATGCGC GTGGCCCAGA TCGTCGGGGA GCCGCTCGAG
ATCCACGGCA TCGGGAGCCC CGCCGAGCGG CGCGAGCGGG TGGCCCGCCT GATGGAACGG
GTGGGGCTTT CCCCGGAGCA GCTCTCGCGC TTTCCGCACC AGTTCTCCGG CGGCCAGCGC
CAGCGCATCG GGATAGCGCG CGCTCTCGCG GTCTCCCCCC GGCTCATCAT CGCCGACGAG
CCGGTTTCGG CGCTCGACCT CTCGATCCAG GCCCAGATCA TCAACCTGCT CCAGGAAGTG
AAAATGGACC TGGGGCTGTC GTTTCTCTTC ATCACCCACG ACCTCTCGGT GTTGAGGCAC
CTAAGCGACC GGATCGCCAT CATGTACCTG GGACGGATCG TCGAGTCCGG GAGCCGGGAC
GACGTACTGT CGAGGCAACT GCACCCGTAC ACGGAGGCGC TTTTAAGCGC CATACCGAGC
ATCGACCCGC GGGAAAAAAG CAGGCACGTC GTAGCGCGCG GGGAACTCCC CTCCCCGCTC
TCCCCCCCCC CAGGATGCCC CTTCCATACC CGCTGCCCCT ACGCGGAGGC GATCTGCGGC
GAGGAGCGCC CCGAGCTTTT GGAGAAGGAA CCCGGCCACT TGGCCGCCTG CCACTTCAGC
AAAAGGATCT ACCGCTCCTA G
 
Protein sequence
MTAPLLQAEK LVKRFAVRGG FLAEKRELTA VAGVDLEIFP GETLGVAGES GCGKSTVARL 
LTGLVPPSEG SIRYGGRELS AMNRGELAQF RREVQMIFQD PFSSLNPRMR VAQIVGEPLE
IHGIGSPAER RERVARLMER VGLSPEQLSR FPHQFSGGQR QRIGIARALA VSPRLIIADE
PVSALDLSIQ AQIINLLQEV KMDLGLSFLF ITHDLSVLRH LSDRIAIMYL GRIVESGSRD
DVLSRQLHPY TEALLSAIPS IDPREKSRHV VARGELPSPL SPPPGCPFHT RCPYAEAICG
EERPELLEKE PGHLAACHFS KRIYRS