Gene Avin_10950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_10950 
Symbolalg44 
ID7760039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1046532 
End bp1047698 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content66% 
IMG OID643803999 
Productalginate biosynthesis protein Alg44 
Protein accessionYP_002798301 
Protein GI226943228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00119266 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG CGACCCTTAA TGTAAACGTG GTCCACGAGT CGGAAGCGCA ACGCCAACAC 
GCGCGCGTCA AGCTGCCCGG TAAGATCCGC TTCCTCGGCC CCAACCGGGA AACCATCGAG
CAGCGTCTGA TCGACATCTC CGCCGGTGGC TTCAGCTTCG CCAGCGGCAA GCCGGTCACC
CAGCAGGGTG CCTTCCACCG CGGCAAGCTG CTGTTCCAGC TGGACAGCCT GGGCCTGGCC
ATGGACGTGG AGTTCCAGGT GCGCAACCTC GACCCGGAAA GCGGCCGCAC CGGCTGCCAG
TTCCACGGTC TGGGCGCACG CGAGATCTCC ACCCTGCGCC AGATGATCAC CTCGCACCTG
AGCGGCGAGT TGGTCACCGT CGGCGATGTG ATCTGCACCC TGCAGCGCGA CAACTTCACC
AAGGCGCGCA AGGGCAAGGG CCTGGCCCAG CAGACCATGT TCGAACGCTT GCGCGCGGTC
AGCTTCAGCC TGGCGATCTT CATCGTCGGC CTCGGCGCCT TCGGCCTGAT CCTCAAGCAG
CTCTACGACC TCTACTTCGT CACCCATGCC GAGTCGGGCA TGGTCAGCGT GCCCAGCATG
GAAGTGACCA TGCCCCGCGA AGGCACGGTG CAGAGCCTGG TCGGCCCGGA CGGCCTGGTC
GCCAACGGTG CGCCGATCGC CAGCTTCTCC GCCTCCATGC TGGAAATGCT CAAGGGCCAC
CTGAGCGAGG AGCAGCTCAA CCCGGCCAAC GTCGAGAAGC TGTTCACCCG GCAGATGAAG
GGCACCCTGA CCAGCCCGTG CGATTGCAAG GTGGTCGCCC AGCGCGTGGC CGACGGCCAG
TTCGCCTCCA AGGGCCAGGT GATCTTCGAG CTGCTGCCGC GCGACGCCGC CGCTACCGTC
GAGGCCCGTT TCCGCTACCA CGACTTCGCC AAGGTCAAGC CGGGCACCCA GGTCACTTTC
AGCGTTCCCG GCGAGGACCA GCCGCGCCGC GGCCGGATCG TCAGCACCGC CCTGCAGAAT
GAAGGACTGT CCAGCGATAT CCGCGTGCTC ATCCAGCCCG AACAGCCCCT GGACAGCGCC
CTGGCCGGCC AGCCGGTGGA AGTGGTCATC GACCACGGCC CTTCCTACGA CTGGCTGATC
GACAAGGCCG TGACCGCCGG ACTCTGA
 
Protein sequence
MNTATLNVNV VHESEAQRQH ARVKLPGKIR FLGPNRETIE QRLIDISAGG FSFASGKPVT 
QQGAFHRGKL LFQLDSLGLA MDVEFQVRNL DPESGRTGCQ FHGLGAREIS TLRQMITSHL
SGELVTVGDV ICTLQRDNFT KARKGKGLAQ QTMFERLRAV SFSLAIFIVG LGAFGLILKQ
LYDLYFVTHA ESGMVSVPSM EVTMPREGTV QSLVGPDGLV ANGAPIASFS ASMLEMLKGH
LSEEQLNPAN VEKLFTRQMK GTLTSPCDCK VVAQRVADGQ FASKGQVIFE LLPRDAAATV
EARFRYHDFA KVKPGTQVTF SVPGEDQPRR GRIVSTALQN EGLSSDIRVL IQPEQPLDSA
LAGQPVEVVI DHGPSYDWLI DKAVTAGL