Gene GM21_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1508 
Symbol 
ID8136837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1763026 
End bp1764096 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID644869120 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003021322 
Protein GI253700133 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.577537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTTT CGGTAGAAAA AAAGGTAGGC ATGTTCTTCC TGTTCGGTCT CATCATCCTG 
GGGGTGCTCC TTGAGGTGGG TGAGAAGTGG AACCCGTTCG AGAAGAACGT CCCCTATAAG
ACCTATCTCA CCAGCATCAC CGGGTTGAAG GTAGGCGACC CGGTGCGGCT TGCCGGCGTA
GACGTGGGGC GCATCAAGGG AATCGACATC CTGAACGACA AGATCGAGAT CAACTTCGAG
GTGAAGCCCG GTACCAGGAT CAAGACGGAC ACGGTTGCGG GACTGCGGCT CACGAACCTT
CTGGGCGGCC AGTTCCTGGG GCTTTCCTTC GGATCCCCCA ACGCGCCGCT TCTGGAGCCG
GGCGGGGTGG TGAAAGGGAA GGACGTCGCC AACATCGACA TCATCGTCGA CAACCTGAGC
GACGTGGTGA AGGACGCCAA GGTCTTCGTG AACAACCTGG ACCGCAACCA GGACATAGTA
CTTAAGAAAA TCTCGGTGAT GCTGGACGAG AACCGCGGCA ACCTGAGGGC GTCCATCTCC
AACATCAACA GCATTACCGG CAAGATGGAC CGCGGCGAAG GGTCGCTGGC GCTGCTTTTG
AACGAAAGAA AGCTCTTCGA CGACGCAAGC GGCGCGGTGG AGAGCCTGAA GGTGGTGGCT
GGCAAGATCG AGCGCGGCGA GGGGACTCTC GGGAAGCTGG TCAACGACGA ATCTGTCTAT
CGCGAAGCCT CCGCCCTCAT CACCGACCTC AGGGCGGGCG CGAAGGACCT GAACGCCGGG
ATGAAGGACG TGAAGGAGAT CACGGCCAAG GTCAACCGCG GCGAGGGGAC CCTGGGCAAG
CTGATCAACG ACGAGACGCT CTATGTCGAC CTGCGCGAGG CGTCGAAAAA CGTGAAGGAA
ATCACTCAGA AGATCAACTC CGGTCAGGGG ACCCTGGGAA AACTGGTCAA CGAGGACCAG
CTCTACCGCG ACACCACCGC CACGCTCAAG AAGACCGAGC GGGCCATGGA AGGTCTCGGA
GACGCAGGTC CCATCTCCGT CATCGGCTCC ATCGTGGGGA CCCTCTTCTA G
 
Protein sequence
MALSVEKKVG MFFLFGLIIL GVLLEVGEKW NPFEKNVPYK TYLTSITGLK VGDPVRLAGV 
DVGRIKGIDI LNDKIEINFE VKPGTRIKTD TVAGLRLTNL LGGQFLGLSF GSPNAPLLEP
GGVVKGKDVA NIDIIVDNLS DVVKDAKVFV NNLDRNQDIV LKKISVMLDE NRGNLRASIS
NINSITGKMD RGEGSLALLL NERKLFDDAS GAVESLKVVA GKIERGEGTL GKLVNDESVY
REASALITDL RAGAKDLNAG MKDVKEITAK VNRGEGTLGK LINDETLYVD LREASKNVKE
ITQKINSGQG TLGKLVNEDQ LYRDTTATLK KTERAMEGLG DAGPISVIGS IVGTLF