Gene GM21_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3508 
Symbol 
ID8138880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4049211 
End bp4050350 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content57% 
IMG OID644871127 
Productglycosyl transferase group 1 
Protein accessionYP_003023287 
Protein GI253702098 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value1.8743499999999998e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCTTC ACAGGCTTTT GAGCGGCTTG GCGTCTACCA GCTACTGTCT TATCTCAAGT 
GGCCGAAAAC ACGATGCCGG GGATGCCGCC TGTGAGCGCT TGGATGCACC GTACTTCTAT
CTGCCAAAGG TGAGGCAGTT GCCGCCGGTA GCGATCCCCG GCCTTTCTGC CCTGTTCGTG
TGTATCAACC TGTTGTGGGT GGTGTTGAGG CGATCCCGGC AGATAGAAGA TATGGCGAGG
CGCGAAGGAT GCGAGGCTAT CGTGGCGTGC ACAGGCGATT TTTATGATCT CCCTGCAGCG
TTTCTGGCCT GCAGGCGCAT GAAGATACCA TTCGTTCCCT ACATCTTCGA CGACTACGGT
TACCAGTGGC TCGGGTTCAG GCGCAGTATC GCCAAGCGGT TGGAGCGTGT CCTGCTCTCC
TTTGCAGCGG CGGTCATTGT ACCGAACGAA TACCTGCAGA GAGAGTACGC AACCAGACAC
GGCATCGACA GCACTGTCAT CCATAATCCC TGCTCTTTGC CGGACCTGGA GCGCCTGGAC
CAGGGGCCGA AGATGTTCGG GGAGGGGGTG AACATCGTCT ATACGGGGTC GATTTACCAC
GCCCATTACG ATGCCTTCGC AAACCTCATC GCTGCTCTCA GGCTTTTGGG CAGGCCGGAG
GTGAAACTGC ACCTGTTCAC CGCACAGTCT GAAAGAGAGT TGGCTGGACA GGGGATAGGT
GGGCCGCAGG TTGTCCACCA CCCTCACGTT CCACAGCGCG AGGTGGAGCG TATCCTGCGG
CAGGCGGACC TGCTTTTTCT CCCGTTGGCG TTTCGTTCCC CGATACCCGA GGTGATAAGA
ACTTCGGCGC CAGGCAAAAT GGGTGAATAT CTCGCTGTGG GGCGTCCTGT GCTCGTTCAT
GCCCTGCCCG ATTCCTTCAT CGCCTGGTAC TTCCGGGCGA ATGGATGCGG GATAGTCGTA
GATCAGCATG ATGCAGGAGT TCTGTCACAG GCAATCGAGG CACTGTTATC GGATCCGCAG
GCGCTGACAG ATATGGGACT TAAAGCAAGA AAGAGAGCTC AAGTTGATTT CGACGTTACC
GTCGTCCGTT CGCAATTCCT TGCTTTACTA AAACGGATCG GCTTTAGGTG TGGTGCATGA
 
Protein sequence
MVLHRLLSGL ASTSYCLISS GRKHDAGDAA CERLDAPYFY LPKVRQLPPV AIPGLSALFV 
CINLLWVVLR RSRQIEDMAR REGCEAIVAC TGDFYDLPAA FLACRRMKIP FVPYIFDDYG
YQWLGFRRSI AKRLERVLLS FAAAVIVPNE YLQREYATRH GIDSTVIHNP CSLPDLERLD
QGPKMFGEGV NIVYTGSIYH AHYDAFANLI AALRLLGRPE VKLHLFTAQS ERELAGQGIG
GPQVVHHPHV PQREVERILR QADLLFLPLA FRSPIPEVIR TSAPGKMGEY LAVGRPVLVH
ALPDSFIAWY FRANGCGIVV DQHDAGVLSQ AIEALLSDPQ ALTDMGLKAR KRAQVDFDVT
VVRSQFLALL KRIGFRCGA