Gene Avin_30040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30040 
Symbol 
ID7761905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3110818 
End bp3111942 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID643805877 
ProductGlycosyl transferase, group 1 family protein 
Protein accessionYP_002800145 
Protein GI226945072 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.060295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTTCC TGCTCGTCGC CGGCTTCCCC GACTCGCTCC TGTCGTTTCG CGGCCCGCTG 
CTCGAAGCCC TGCTCGCCCG CGGCCTGGAG GTGCACGTGG CGGCGCCGCA CCTGGCGCCC
GGCTGCCTCC TGCGCCAGCG CCTGGAGGCA CGCGGTCTGC GGGTGCACGA CATTCCCCTG
CGGCGCACCG GCATGAATCC GCTGCAGGAC TGCGCCACCC TGCTGCACCT GTGGCGGCTG
AAGCGGCGCA TCCGCCCGAC CCATGTCCTC GGCTACACCG CCAAGCCGGT GATCTACGGT
TCGCTGGCCG CCGCCTGGGC CGGGGTGCCG CGACGCTTCG CGCTGATCAC CGGGCTGGGC
TACGCCTTCC TCGGCGAGGC GGGGGACGGC GGCGCCCGCG GCCTGCTGCA CGCCCTGCTG
CCGCGCCTCT ACGCGCTGGC GCTGCGGCGA ACCCACAAGG TGTTCTTCCA GAACCCGGAC
GACCAGGCCC TGTTCCGCGG CCAGGGCATC CTCGGCCCGG CGACGCCCTC CTGCGTCATC
AACGGTTCCG GCGTGGACCT GCTCGAATAC CCCGTCGCGC CCGTACCGGC GCGACCGCAC
TTCCTGCTGA TCGCCCGGTT GCTGGGCGAC AAGGGCGTGC GCGAATACGC TGCGGCGGCG
CGCCAGGTGA AGAACCGCTG CCCGGCGGCG CTGTTCAGCC TGGTCGGCTG GATCGACGAC
AACCCCGACG CCATCGGCCA GGCGGAACTG GACGGCTGGC TGGCCGACGG CACGCTGCAC
TACCTCGGCC GCCTGGACGA CGTGCGCCCG GCGATCGCCG CCTGCAGCGT GTACGTCCTG
CCCTCCTACC GCGAAGGCAC GCCGCGCACG GTACTGGAAG CCATGGCCAT GGGCCGCGCG
GTGATCACCA CCGACGCCCC CGGCTGCCGC GAGACGGTGG TGGACGGCGA CAACGGCTTT
CGCGTGCCGG TGAAGGCGGT GGACGAGTTG GCCCGGGCCA TGCAGCGCTT CGTCGAGGAA
CCGGCGCTGG CCGTGCGCAT GGGCGCCCGC TCGCGGCAAC TGGCCGAGGA GAAATATGAT
GTGCAGCGGA TCAACGCCCG CCTGCTGCAG GAGATGGGTC TCTGA
 
Protein sequence
MIFLLVAGFP DSLLSFRGPL LEALLARGLE VHVAAPHLAP GCLLRQRLEA RGLRVHDIPL 
RRTGMNPLQD CATLLHLWRL KRRIRPTHVL GYTAKPVIYG SLAAAWAGVP RRFALITGLG
YAFLGEAGDG GARGLLHALL PRLYALALRR THKVFFQNPD DQALFRGQGI LGPATPSCVI
NGSGVDLLEY PVAPVPARPH FLLIARLLGD KGVREYAAAA RQVKNRCPAA LFSLVGWIDD
NPDAIGQAEL DGWLADGTLH YLGRLDDVRP AIAACSVYVL PSYREGTPRT VLEAMAMGRA
VITTDAPGCR ETVVDGDNGF RVPVKAVDEL ARAMQRFVEE PALAVRMGAR SRQLAEEKYD
VQRINARLLQ EMGL