Gene Avin_44850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_44850 
Symbol 
ID7763356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4541185 
End bp4542249 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content73% 
IMG OID643807336 
Productlipopolysaccharide heptosyltransferase I, waaC 
Protein accessionYP_002801577 
Protein GI226946504 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.721697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGTAC TGCTGATCAA GACCTCCTCG CTGGGCGATG TCATCCATAC CCTGCCGGCC 
CTGACCGACG CGGCGCGGGC GCTGCCCGGC ATCCGTTTCG ACTGGGTGGT GGAGGAGGGC
TTCGCCGAGA TTCCCGCCTG GCATCCGGCC GTGGAGCGGG TGATTCCCGT GGCCATCCGC
CGCTGGCGCC GGAGTCCCTG GCAGGCGACC ACCCGTGACG AATGGCGGCG CTTTCGCCAG
ACCCTGGGAG AGGGCCGCTA CGACCTGGTG ATCGATGCCC AGGGGCTGTT GAAAAGCGCC
TGGCTGACCC GCTTCGGCGG CGCGCCGGTG GCCGGGCTGG ATCGCCGTTC GGCGCGCGAG
CCGCTGGCCA GTCTCCTCTA TGGGCGGCGC TATCCCGTGC CTTGGGGACA GCACGCGGTG
GAGCGGGTGC GCCAGTTGTT CGCCCAGGCG CTGGGCTATC CGCCGCCGAC GGCGGTCGGC
GACTACGGAC TGGACCGCCA CCGCTTGGCC GTGCCGGACG GCGCGCCCTA CCTGCTGTTC
CTGCACGGCA CCACCTGGGA CAGCAAGCAC TGGCCGGAAA GCTACTGGCG CGAGCTGGCC
GAACGCATGG GCAGTGCGGG CTGGGCGGTG CGCCTGCCCT GGGGCAATGC GGTGGAGCGG
GACCGCGCCG GGCGCATCGC CGAGGGGCTG GCGTCGGTCG AGGTGCTGCC CCGGATCAAC
CTCGCCGGCA TCGCCGGGAT TCTCGCCGGG GCCAGCGCCT GCGTGGCGGT GGACACCGGC
CTCGGCCACC TGGCGGCGGC GCTGGATGTG CCGACTGTCT CCCTCTACGG CCCGACCGAT
CCGCGCCTGA CCGGCGCCTA CGGTCGTCAC CAGCGCCGCC TGACCAGCGA CTACCCGGCC
TGCGTGCCCT GCCTGCGCAA GACCTGCGGC TACCGGCCGA CCGAGGAGGA CCGCCGCCGG
CTGGATTTGA GCCGCGAGCA GCCGGTGTGC TTCAGTCGCA TCGATCCGCG GCGGGTGGCC
GGCGCCTTGC AGGCGCTGCT GGACGAGGCG GCCTGCCGAT GCTGA
 
Protein sequence
MRVLLIKTSS LGDVIHTLPA LTDAARALPG IRFDWVVEEG FAEIPAWHPA VERVIPVAIR 
RWRRSPWQAT TRDEWRRFRQ TLGEGRYDLV IDAQGLLKSA WLTRFGGAPV AGLDRRSARE
PLASLLYGRR YPVPWGQHAV ERVRQLFAQA LGYPPPTAVG DYGLDRHRLA VPDGAPYLLF
LHGTTWDSKH WPESYWRELA ERMGSAGWAV RLPWGNAVER DRAGRIAEGL ASVEVLPRIN
LAGIAGILAG ASACVAVDTG LGHLAAALDV PTVSLYGPTD PRLTGAYGRH QRRLTSDYPA
CVPCLRKTCG YRPTEEDRRR LDLSREQPVC FSRIDPRRVA GALQALLDEA ACRC