Gene Gbem_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_2041 
Symbol 
ID6782035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp2348170 
End bp2349180 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID642768036 
Producttransglutaminase domain protein 
Protein accessionYP_002138850 
Protein GI197118423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGT TGTTGCTGTC GATGCTTGCA CTCTCACTCT GCGTTACTCC GGCTGCCTGG 
GCTAAAAGCC GCAGCGGTGT CGTCACGGTG GAAGTGGATC TTTCCAAACA AGAGCAGGGC
AAGGAGACGA AACTCTGGAT TCCGTACGCG GTTTCCGGGA AACACCAGGC AGTCACGGAC
GTCAAGGTGA GCGGCGATTT CGCCACCTCG GCGGTCTACA CCGACAAGGC CAACGGCACC
CCCATCCTCT TCGCCCAGTG GGGCAAGGAT GCGGCCAGCC GCAAGCTCAC CTACTCCTTC
TCCGTGGAAC GCGAGGAGTT ATTGGTGCGA GACCTTTCCG CCAAGGAGAC ATCCTGGAGC
AAGGAGGAGT TCGCCCCCTA CCTGCAGTCG ACCTCCATGG GCCCGGTCGA CGGCGAGGTG
AAAAAGCTCT CCGATTCCAT CACCAAGGGT AAGCACACGG TGCTGGAGAA GGCGAAGGCG
ATCTATGACT GGACCTGCGA GAACATGTAC CGCGATCCGG CCACCGTCGG CTGCGGCAAG
GGGAATGTCT GCGAACTGCT GAAAAAGCCC GGCGGCAAGT GCACCGACAT CTCGTCGGTC
TACGTCGCCC TGGCGCGCGC TGCCGGCGTT CCCTCCCGCG AGGTCTTCGG GGTGAGGCTG
GGCAAAAAAG CGACGGAGGA CATCACCTCC TGGCAGCACT GCTGGGTCGA ATTCTACCTC
CCCGGCACCG GCTGGGTCCC GGTCGACCCG GCCGACGTGA GAAAGGCGAT GCTGGTCGAG
AAGCTCGATC CGAAGGATGC GAAGACCCGC GAGTATCGGG ACTACTTCTG GGGCGGGATC
GACCCGTACC GCTTCCAGGT CGCTACCGGC CGCGATATCG TCCTGAACCC GCCGCAGGCA
GGCGCTCCGC TCAACACCTT CGGCTACCCT TATGCAGAGG TAGGCGGTAC TGCGCTTGAC
TTCTACGATC CCAAGAGCTT CAGCTACCGG ATCACCTATA AGGAGCAGTA G
 
Protein sequence
MKRLLLSMLA LSLCVTPAAW AKSRSGVVTV EVDLSKQEQG KETKLWIPYA VSGKHQAVTD 
VKVSGDFATS AVYTDKANGT PILFAQWGKD AASRKLTYSF SVEREELLVR DLSAKETSWS
KEEFAPYLQS TSMGPVDGEV KKLSDSITKG KHTVLEKAKA IYDWTCENMY RDPATVGCGK
GNVCELLKKP GGKCTDISSV YVALARAAGV PSREVFGVRL GKKATEDITS WQHCWVEFYL
PGTGWVPVDP ADVRKAMLVE KLDPKDAKTR EYRDYFWGGI DPYRFQVATG RDIVLNPPQA
GAPLNTFGYP YAEVGGTALD FYDPKSFSYR ITYKEQ