Gene GM21_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0723 
Symbol 
ID8136038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp867654 
End bp868700 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID644868340 
Productglycosyl transferase family 2 
Protein accessionYP_003020555 
Protein GI253699366 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00000000000000115324 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCCTTTG ACGAGCGCCC CGAGCTGTCG GTCGTCGTGC CGGTGTACAA CGAGGAGGGG 
AACCTGCTTT CCTTCCTGGA GGCGATGGCG AGGCAGCGCG AGTTGCACCT GGAGGTGATC
ATCTCCGACG GGGGGTCCAG CGACGGCGGC ATCGGGGTTG CCCGCGGCTT CGCGGCTGAC
GCCCCCTTTG CGGTGACGAT AATCGAGGGG GCCAAGGGAA GGGGAGCACA GTTGAATCTA
GGGGCGGACG CGGCGCGCGC CCCCCTTTTG CTCTTTCTCC ACGTGGATTC AAGGTTCGAC
GACCCGCTGG CGCTCAGAAA GGCCGTGGAC GCGCTCGAAA AAGCGCGCCG GGAGGATAGA
AGGGTCGCCG GCCGCTTCTC CCTCCGCTTC GATTTCGAGG GGGCCGCCCC GCTCCCCTAC
CGCTTCTACG GTGCCAAGGC GACCTTGGAC CGCCCCGGAT GCACCCACGG CGACCAGGGG
TTCATGATGG GGAGCGACTT CTTCAACGAG CTCGGCGCCT TCCAGAGCGC GCTCCCGCTC
ATGGAGGACA CCTTCCTCGC CGAGAGGGTG AGGGAAAAGG GGAGCTGGAT CCTCTTCCCA
GCCCGCATCG CCACCTCGTC CCGCCGCTTC CTCACGGAGG GGCTCCTCCC CCGGCAGAGC
CTGAACGCCA TCCTGATGAA CCTGGCCACA CTCGGGCACC TCTCGCTGAT CGAATCCCTG
CGGGAAAGCT ATCGCAGCCA CGACGCGGCG AAGCGTCTGG AGCTGCGCCC CATTCTGCAC
CCCCTCAACC TCAAGATGGC GCAACTGCCG CGCCGGGAGC GGTGGCGGCT TTGGTACCGG
ACCGGGAGCT ACGTGAGGAG CAACGCCTGG CAGATCGCCT TTTTTCTGGA TGTGGTGACG
GGGGGGGCGG GGGAAGGGAA GGGGGGAAGA TTTCTCTCGC TGCACGACCG CCTTTTGGGG
CGGCTCATCG ACAACAGGGC CGGCAACTGC GCTGCGGCCC TTTTCACCTG GTTCTGGTTT
CGGACAACCT TGCGCCTTTG CCGCTAG
 
Protein sequence
MSFDERPELS VVVPVYNEEG NLLSFLEAMA RQRELHLEVI ISDGGSSDGG IGVARGFAAD 
APFAVTIIEG AKGRGAQLNL GADAARAPLL LFLHVDSRFD DPLALRKAVD ALEKARREDR
RVAGRFSLRF DFEGAAPLPY RFYGAKATLD RPGCTHGDQG FMMGSDFFNE LGAFQSALPL
MEDTFLAERV REKGSWILFP ARIATSSRRF LTEGLLPRQS LNAILMNLAT LGHLSLIESL
RESYRSHDAA KRLELRPILH PLNLKMAQLP RRERWRLWYR TGSYVRSNAW QIAFFLDVVT
GGAGEGKGGR FLSLHDRLLG RLIDNRAGNC AAALFTWFWF RTTLRLCR