Gene Avin_29990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_29990 
Symbol 
ID7761900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3105023 
End bp3106159 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID643805872 
ProductGlycosyl transferase, group 1 family protein 
Protein accessionYP_002800140 
Protein GI226945067 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCT CGCCCCGATC GGTCGTCCAC CTGCTCGCCT CGCTGGACTT CGGCGGCGTG 
GAACGGCGCA TGGAACTGCT GGCCGAGCAG CCCGCCGGCG ACATACGGCA TCTGTTCTGC
GCCATCGGCG GCGGCGGCAA TGCCGAACGC CGCCTGCAGA GCCTCGGCGC TCCGGTCCGC
TGCCTGCACC AGCCGACGGC GATCCCCAGT CCGGCCGCGA TCCTCGCGCT CGTCCGCCTG
CTCCGCCGGC TGCGCCCGAC GGTGCTGCAC GCCCACGGCG CCGAGGCTAA CTTCCACGGT
CTGATCGCCG CCCGGCTGGC CGGGGTGCCG GTGCGGATCG CCGAGGAGAT CGGCATCCCG
ACGCACAGCG CGCGGGCCCG CCGGGTGTTC CGCCAGCTCT ACCGCAGCGC CCACTGCGTC
GTCGGCATCT CCGACGCGGT GACCGGCTGG CTGGTCGACA GCGGCGAAGT GCCGCCGGAC
AAGGCGATCC GCATCTACAA CCCGGTCAAG CTGCCGGACC GGCACGACCG GCAGGCAGCG
CCGGAGGACG GGCTGCGCAT CGCCTTCGTC GGCCGTCTCG AAGCGGTCAA GAACCCCCTG
GCGCTGGTCG AGGCCGCCGC CCTGCTGCTG GCCCGCGGGA TTCCCGTGGA ACTCTGGCTG
ATCGGCGAGG GCCGCGAGCG GCAGCGCCTG GAAGCCATGG TCCGCGCCCG GGGACTGGAC
AGGCGCGTGC ATCTGCCGGG CTACCGGGCG CATCCCGAGG CGTACGTGCG CCGCTGCCAC
CTCTATGTCC AGCCCTCGCG CTCCGAAGGC TTCGGCCTGG CGCTGGTCGA GGCCATGGGC
TGCGGCCTTC CGGTCGTCGC CACGGCGGTG GGCGGCGCGC CGGAGATCGT CGAGTCCGGC
GTCACCGGCT GGCTGCTGCC GGAAGCGACG CCGGCCGCCC TCGCCGATGT CCTCGAAGCG
GCCTGGCGGC TCGGCCCGCG ACGGCTGGAA AGCATGGGCG AACGGGCCCG CGGCGCCGTC
GAGGGACGTT TCGAACCAGC CCGCTACAAG GCCCGGCTGG AAACCCTGTA CCGACGATTC
ACCCCGCGAA AGGCCAAAGG CGAGCATGGA AAAGATTCGG ATTCTGCACT GTCTTGA
 
Protein sequence
MSASPRSVVH LLASLDFGGV ERRMELLAEQ PAGDIRHLFC AIGGGGNAER RLQSLGAPVR 
CLHQPTAIPS PAAILALVRL LRRLRPTVLH AHGAEANFHG LIAARLAGVP VRIAEEIGIP
THSARARRVF RQLYRSAHCV VGISDAVTGW LVDSGEVPPD KAIRIYNPVK LPDRHDRQAA
PEDGLRIAFV GRLEAVKNPL ALVEAAALLL ARGIPVELWL IGEGRERQRL EAMVRARGLD
RRVHLPGYRA HPEAYVRRCH LYVQPSRSEG FGLALVEAMG CGLPVVATAV GGAPEIVESG
VTGWLLPEAT PAALADVLEA AWRLGPRRLE SMGERARGAV EGRFEPARYK ARLETLYRRF
TPRKAKGEHG KDSDSALS