Gene GM21_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3406 
Symbol 
ID8138773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3937315 
End bp3938328 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID644871023 
Productglycosyl transferase family 9 
Protein accessionYP_003023188 
Protein GI253701999 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTA AGGTCACAAA CATCCTCATC ATCAAGCCGG GCGCCATCGG CGACCTGTTG 
CACATGACCC CCGTGGTGCG TGCCCTGAAG GGGATCTATC CCGCGGCCTC CATCACCATC
ATGGTCAGCT CCCGCGTCAC CGCGCTTTTA TTCGCCGACA ACCCGATGGT GGACGAGGTG
GTGATCTTCG ACAAGAAGGG CGAACAGAAG AGCTGGGGCG GGGTCTTCAA GCTCTGGAAG
CGGCTCAGGC CCAAGCGTTT CGACCTGGTG CTCAACTACC AGCGCAGCAA CCTGAAGGGT
TGGGCGCTGG TGACGGCGGC GATTCCGTGT CGCGTGCTGG TCTACCACAA GACCCGCGGC
AGGGTGATCC ACGCCATCGT CGACCACCTG CGTCCACTGG CCTGCCTCGG GGTGGACCCG
GAGCGGGCGG ACCGCTCCCT CGATTTCTTT CCGTCTCAGG CGGACACCGA CTACGCCGAG
CGCTTCGTCC GGGAGAACGG TCTGGCCGGC AGGCGCCTGG TCGCCTTCAA CCCCGGGACC
AGCAGCGAGA ACAAGTGTTG GCCCATCGAG CGCTACGCGG AACTGGGGGA CCGGTTGGCC
GCCCGCGGCG TTGCCGTGGT GGTGGTCGGG AGCCGGGACG AGGCTCCGCT TGCGGCGGCG
ATACGCGCCG GGATGAAGGA AGAGGTGTAC GATCTGTGCG GCTGTTCGTT GGGTGAGCTT
GCCGCTTTGC TTAAGCACTG CGAATTCCTG GTCACCGGCG ACACCGGTCC CATGCACATA
GCGGCAGCGG TCGGCACCAG GAACCTGGCG CTCTACGGCC CGATCAGCCC GGTCAGAAGC
GGCCCGGTTG GCGAGGGGCA CCGGATCGTC ATCCACGACG AACTGGACTG CTGCCCCTGC
AACAGTTTCA AGTGCAGCAA CAAGGAGTTC CGGCTCTGCA TGGAGAGAAT CACGGTCGAC
GAGGCGGACA AGGTAGCGGC GGAAATGTTG GCAGTCAAAC GAGAGGTGAA ATGA
 
Protein sequence
MNAKVTNILI IKPGAIGDLL HMTPVVRALK GIYPAASITI MVSSRVTALL FADNPMVDEV 
VIFDKKGEQK SWGGVFKLWK RLRPKRFDLV LNYQRSNLKG WALVTAAIPC RVLVYHKTRG
RVIHAIVDHL RPLACLGVDP ERADRSLDFF PSQADTDYAE RFVRENGLAG RRLVAFNPGT
SSENKCWPIE RYAELGDRLA ARGVAVVVVG SRDEAPLAAA IRAGMKEEVY DLCGCSLGEL
AALLKHCEFL VTGDTGPMHI AAAVGTRNLA LYGPISPVRS GPVGEGHRIV IHDELDCCPC
NSFKCSNKEF RLCMERITVD EADKVAAEML AVKREVK