Gene GM21_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3950 
Symbol 
ID8139324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4533715 
End bp4535337 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content64% 
IMG OID644871567 
Productglycosyl transferase family 39 
Protein accessionYP_003023725 
Protein GI253702536 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.00000131348 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCAAG CGCAGCAAGC TAAGGGGCTC TGGCTCCCTT TCCTGATCCT GGCGGGGACC 
TGCCTCCTCT TCTCGCTGGT CCTCCCCTTC TTCCCCGTGG ACGAGACCCG CTACCTCTCC
GTGGCCTGGG AGATGAGGGT GCACGACTCC TTCATCGTCC CGATCCAGAA CGCGCTCCCC
TATTCGCACA AGCCCCCCCT GCTCTTCTGG CTGATCAACC TGGACTGGCT CCTCTTCGGG
GTCAACGAGG GGACGCTCCG CTTCATCCCG CTCATCTTCA GCCTTTTCAA CGTCTGCCTG
GTCTACCGCA TCGCCCTTGA GCTCTGGGAG GACAGCAAGC TCGCGCTGAA CGCCGCCGTC
ATTCTCGGCT CCACCTTCTC CTATCTTTTG TGGTCCTCGG TGATCATGTT CGACGTGATC
CTCTCCTTCT GGGTCCTCGT AGCCGTTTTC GCCTTCATCC GCGCCGGCAC AAAGATGAGG
TTCGCCGACT TCGCCCTGGC GGGCGCGGCC ATAGGATGCG GCATCCTCAC CAAGGGACCG
GTGGTGCTGG TCTACATCCT GCCGGTGGCG CTCTTCGCCT TCTGGTGGCA GCCCAAAGGC
GAGGTCGCTC CCAAGTGGTA CGGCTTTTTG CTCCTCTGCC TGCTGATAGG TATCGCGGTG
GTCCTCGCCT GGCTCATCCC GGCGGCGCTC ACCGGGGGGG AGGTTTACCG AAAGGCGATC
CTTTGGGGGC AGACGGTGCA CCGCATGGCC AACTCCTTCG CCCACAAAAG GCAGCTTTGG
TGGTATTTCC AGTGGATCCC GGTCCTTCTG GCCCCTTGGA TCTTCTTCGC CCCCTTCTGG
CGCGGCTGCC GCCGCCTGCC CCTGGACGCG GGGACCAGGC TGGTGCTCAC CTGGATCGTT
GCCGGTTTCG TCGTCTTCTC CTTTTTGAGC GGAAAGCAGG TGCACTACCT GATCCCCCTG
ATCCCCGCCT GCAGCCTGCT GATGGCGAAG GCGATCGCCA GCTCCGAGGA GAGCGGGCGG
CGCTTCCAGC TCCCGATCGC CGTCTTCTAC CTGGTCCTCG GCGCGGCCAT CGCGATTATC
ACCTTCCTGA AGCAGGGGCG CGCGCTGCAA AACTTCGATC CCGGCGAGTT GAGAATCGCC
GCCGCCGGCC TCATGCTCCT CGGCGCCGCG CTCTCCTTCC CCAAGCCGCG CGACGCCTCG
GCCGCCCTCA AGCTGGTGGC GCTCTCAGCA CTCCCCTTCT TCGCCCTTGT GGCGGTCGGT
TGCCACACCT TCTCCGGGCG CTACGACCTC CACGCCGTCT CGGCCGCGGT ACTAAAGAAG
CAGCAGGAAG GGTATCAGGT GCTGCACCGG GGAAAATACC ACGGCCAGTA CCATTTCATG
GGGCGGCTGC AACTGCCGCT GCTGCAACTG GAGGATAGCG ACGAGATCCG CCGCTACGCG
CAGACCCGCG AAAAGGTGGC GCTCTTGAGC TATACGCCCG ACGACCAGGC GGTGCAGCCG
GAGGAGGCCT TCTTCCGGCA GCCCTTCAGG AGCAAGCAGG TGGTCCTCTG GAACAGCCGG
GGGATCCTGC AGAACCTGGA CGGCGCCAAG GCCGCCGCGA CCCCGCCCCC AGGGAAACCA
TAA
 
Protein sequence
MSQAQQAKGL WLPFLILAGT CLLFSLVLPF FPVDETRYLS VAWEMRVHDS FIVPIQNALP 
YSHKPPLLFW LINLDWLLFG VNEGTLRFIP LIFSLFNVCL VYRIALELWE DSKLALNAAV
ILGSTFSYLL WSSVIMFDVI LSFWVLVAVF AFIRAGTKMR FADFALAGAA IGCGILTKGP
VVLVYILPVA LFAFWWQPKG EVAPKWYGFL LLCLLIGIAV VLAWLIPAAL TGGEVYRKAI
LWGQTVHRMA NSFAHKRQLW WYFQWIPVLL APWIFFAPFW RGCRRLPLDA GTRLVLTWIV
AGFVVFSFLS GKQVHYLIPL IPACSLLMAK AIASSEESGR RFQLPIAVFY LVLGAAIAII
TFLKQGRALQ NFDPGELRIA AAGLMLLGAA LSFPKPRDAS AALKLVALSA LPFFALVAVG
CHTFSGRYDL HAVSAAVLKK QQEGYQVLHR GKYHGQYHFM GRLQLPLLQL EDSDEIRRYA
QTREKVALLS YTPDDQAVQP EEAFFRQPFR SKQVVLWNSR GILQNLDGAK AAATPPPGKP