Gene Avin_44840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_44840 
SymbolwaaG 
ID7763355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4539990 
End bp4541114 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content68% 
IMG OID643807335 
ProductLipopolysaccharide core biosynthesis protein 
Protein accessionYP_002801576 
Protein GI226946503 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTGG CCTTCATCCT CTACAAGTAC TTTCCCTTCG GCGGGCTGCA GCGCGACTTC 
ATGCGCATCG CCCTGGAATG CCAGAAGCGC GGACACGCCA TCCGTGTCTA CAGCATGTCC
TGGGAGGGCG AGACCCCGCC GGGCTTCGAG GTGCTGATCG CGCCGATCAG GGCGTTGTTC
AACCACCGCC GCAACGAGAA GTTCACCGCC TGGGTGCGGG CCGACCTGGC GCGGCGTCCC
GTGGACCGGG TGGTCGGCTT CAACAAGATG CCCGGGCTGG ACGTCTACTA CGCCGCCGAC
CCCTGCTACG AGGACAAGGC GCAGACCCTG CGCAACCCGC TCTATCGCCT GTGGGGGCGC
TACCGGCATT TCGCCGGCTA CGAGCGCGCG GTGTTCGCGC CGCAGGCGAA GACACGCATA
TTGATGATCT CCGAAGTGCA GCAACCATTG TTCGTCAAGC ATTACGGCAC GCCGGCGGAG
CGTTTCCACC TGCTGCCTCC GGGCATCTCC GCCGACCGCC GCGCGCCGCC CGATGCGGAC
GTGATCCGCG CCGACTTCCG CCGCGAGTTC GGCCTGGCCG GGGACGACCT GCTGCTGGTG
CAGATCGGCT CCGGCTTCAA GACCAAGGGG CTGGACCGCA GCCTCAAGGC GCTCGCCGCG
CTGCCCGGCG CCCTGAAGAA GCGCACCCGG CTGATCGCCA TCGGCCAGGA CGATCCGCGC
CCCTTCCAGT TGCAGATCAA GGCGCTCGGC CTGTCCGGGC GGGTCGAGAT TCTCAAGGGG
CGCAGCGACA TCCCGCGCTT TCTGCTCGGC GCCGATCTGC TGATCCACCC GGCCTACAAC
GAGAACACCG GTACCGTGCT GCTCGAGGCG CTGGTCGCCG GACTGCCGGT GCTGGTCACC
GATGTCTGCG GCTATGCCCA CTACATCGCC GAGGCGGGTT GCGGGCAGGT GCTGCCCAGC
CCCTTCGAGC AGGAGCGCCT GAACCGGACG CTCGCCGCGA TGCTCGAAGA CGATGGGCAG
CGAGCGCTCT ATCGGCGCAA CGGCCTGGCC TACGCCGGAA CCGCCGACCT CTATTCCATG
CCGCAGCGGG CCGCCGATCT GATCCTGACG GAGCAGGGCG CGTGA
 
Protein sequence
MQLAFILYKY FPFGGLQRDF MRIALECQKR GHAIRVYSMS WEGETPPGFE VLIAPIRALF 
NHRRNEKFTA WVRADLARRP VDRVVGFNKM PGLDVYYAAD PCYEDKAQTL RNPLYRLWGR
YRHFAGYERA VFAPQAKTRI LMISEVQQPL FVKHYGTPAE RFHLLPPGIS ADRRAPPDAD
VIRADFRREF GLAGDDLLLV QIGSGFKTKG LDRSLKALAA LPGALKKRTR LIAIGQDDPR
PFQLQIKALG LSGRVEILKG RSDIPRFLLG ADLLIHPAYN ENTGTVLLEA LVAGLPVLVT
DVCGYAHYIA EAGCGQVLPS PFEQERLNRT LAAMLEDDGQ RALYRRNGLA YAGTADLYSM
PQRAADLILT EQGA