Gene Avin_05370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_05370 
Symbol 
ID7759493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp517314 
End bp518903 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content67% 
IMG OID643803457 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002797765 
Protein GI226942692 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCC TCAGGCACGA GAACTCCCCG CACGCCTTCT TCTCCGCGCT GTTCGCCAAC 
CGCCGGCTGG TCAAGCGGGT ATTCCTGGCC TTCGCCGCGC TCGCCCTGCT GCTGCCGCTG
CTGCTCGGCC GCTCGTACGA GATCGGCGCG GAGGTCATGG TACAGTCGAA GAAGGTGGCC
CAGACCGAAC CCAACAGCGC CACCCTGCAG CAGGAGACCG ACAAGTTCCT GCCGCCGACC
CTGGCCGACA TGGAAACCGA GAGCAGCATC CTGCGCTCGC CCGAACTGGT CCGGGCGACC
CTCGAGTCGC TGATCCGCGA GGGCCACTTC GCCGAGGACA AGGGCCTGGC GGGCCGGCTG
CGCGACTACC TGGCCGTGCC CCTGCGCGAG CATGTGCTCG ACCCGCTGCG CACCGCCCTG
GGCCTGGCCG CCGACGCGCC GCGCGACAAC CGCCTGGACG AACTGACCCT GGCCATCCTG
GAAGACCTCG AAATCATCCC GCTGCCAGGC TCCAACGTCA TCGCCGTGCA TTACCGCTCC
GGCGACCCGG CCCTCGGCAC CCTGTTCGTC AACCGCCTGC TCGACACCTA CCTGGTCCGC
CGCCATGCGC TGCACTCCAA CGAACTGCCG GAAGCCTTCT ACGAACAGAA GAAGACCCAG
TACCAGGACC AGTTGAACGG CCTCGAAGCG CAACGGCTGG CCCTGCTCGA ACGGATTCGC
GCGGCCAACC CCGAGGAGGA AATCACCTTC CGCCTCAACG CCATCAACCA GGAAGAACAG
GCCCTCAACC AGTACCGCGA CCGCCTGCTG GAGAACCAGC GCTGGGTCGA CTACCTGCAG
GGCAACCTGG CCGTGGCGCG CAAGGCCAGG CTGACCGACT ACGGCTTTCC CTACACCTTC
GCCAACACCC TCGACAACGC CGCCTTCGAG GACCGCGAGA TCCGCCAACT GGGCGACCGG
CTGATCGAGC AGATCGGCCG CTACGGCGCC GAGACCGACA TCTACAACCC GAACAGCGAG
CCGATGAAGA ACCTCCATGC GCAGATCAGC CGGACCCGCC AGCAATTCCT CCAGGTGATC
GGCAACCGCA TCAGCGAACG GCGCAAGGAA CTGGAGATCA TCTCCGGGGT GATCGCCCAG
AAGACCACGC GCATCGAGGA ATACCAGGCG CGCATCCGCG AGCTGCAGGA CGCGCAGAGC
GGCCTGCGCC AGCTCAACAC CGAGATCGAG GCCCTGCACC AGGCCTTCTT CACCTACACC
CAGCGCTACG AGGAAAGCCG CAGCCGCGCC CTGCTCGACG GCGGCCTGTC CAACGCCAAG
GTGCTCAGCC GCCCCTTCGA GCCCAGCGAG GCGAGCTTTC CGCGGCCGCT GCAGATCATC
CCTCTCGGCC TGCTCACCGC CCTGCTGCTG GCCGTCGCCG CGGGATGCCT GCGCGAATTC
TTCGATCGCC GCTTCAAGTA TCCCGGGCAG TTGCAGAGCC AGCTCGGCCT GCCCGTGCTG
ATGACCCTCA ACGCCGAGCA GCCGGCCGCG CTGCCCAATC CGCACAAACC CGGGAGCCTG
CCATGGATTC GACACTGGGC GAGCGACTGA
 
Protein sequence
MDALRHENSP HAFFSALFAN RRLVKRVFLA FAALALLLPL LLGRSYEIGA EVMVQSKKVA 
QTEPNSATLQ QETDKFLPPT LADMETESSI LRSPELVRAT LESLIREGHF AEDKGLAGRL
RDYLAVPLRE HVLDPLRTAL GLAADAPRDN RLDELTLAIL EDLEIIPLPG SNVIAVHYRS
GDPALGTLFV NRLLDTYLVR RHALHSNELP EAFYEQKKTQ YQDQLNGLEA QRLALLERIR
AANPEEEITF RLNAINQEEQ ALNQYRDRLL ENQRWVDYLQ GNLAVARKAR LTDYGFPYTF
ANTLDNAAFE DREIRQLGDR LIEQIGRYGA ETDIYNPNSE PMKNLHAQIS RTRQQFLQVI
GNRISERRKE LEIISGVIAQ KTTRIEEYQA RIRELQDAQS GLRQLNTEIE ALHQAFFTYT
QRYEESRSRA LLDGGLSNAK VLSRPFEPSE ASFPRPLQII PLGLLTALLL AVAAGCLREF
FDRRFKYPGQ LQSQLGLPVL MTLNAEQPAA LPNPHKPGSL PWIRHWASD