Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_05370 |
Symbol | |
ID | 7759493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 517314 |
End bp | 518903 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643803457 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002797765 |
Protein GI | 226942692 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.138893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCCC TCAGGCACGA GAACTCCCCG CACGCCTTCT TCTCCGCGCT GTTCGCCAAC CGCCGGCTGG TCAAGCGGGT ATTCCTGGCC TTCGCCGCGC TCGCCCTGCT GCTGCCGCTG CTGCTCGGCC GCTCGTACGA GATCGGCGCG GAGGTCATGG TACAGTCGAA GAAGGTGGCC CAGACCGAAC CCAACAGCGC CACCCTGCAG CAGGAGACCG ACAAGTTCCT GCCGCCGACC CTGGCCGACA TGGAAACCGA GAGCAGCATC CTGCGCTCGC CCGAACTGGT CCGGGCGACC CTCGAGTCGC TGATCCGCGA GGGCCACTTC GCCGAGGACA AGGGCCTGGC GGGCCGGCTG CGCGACTACC TGGCCGTGCC CCTGCGCGAG CATGTGCTCG ACCCGCTGCG CACCGCCCTG GGCCTGGCCG CCGACGCGCC GCGCGACAAC CGCCTGGACG AACTGACCCT GGCCATCCTG GAAGACCTCG AAATCATCCC GCTGCCAGGC TCCAACGTCA TCGCCGTGCA TTACCGCTCC GGCGACCCGG CCCTCGGCAC CCTGTTCGTC AACCGCCTGC TCGACACCTA CCTGGTCCGC CGCCATGCGC TGCACTCCAA CGAACTGCCG GAAGCCTTCT ACGAACAGAA GAAGACCCAG TACCAGGACC AGTTGAACGG CCTCGAAGCG CAACGGCTGG CCCTGCTCGA ACGGATTCGC GCGGCCAACC CCGAGGAGGA AATCACCTTC CGCCTCAACG CCATCAACCA GGAAGAACAG GCCCTCAACC AGTACCGCGA CCGCCTGCTG GAGAACCAGC GCTGGGTCGA CTACCTGCAG GGCAACCTGG CCGTGGCGCG CAAGGCCAGG CTGACCGACT ACGGCTTTCC CTACACCTTC GCCAACACCC TCGACAACGC CGCCTTCGAG GACCGCGAGA TCCGCCAACT GGGCGACCGG CTGATCGAGC AGATCGGCCG CTACGGCGCC GAGACCGACA TCTACAACCC GAACAGCGAG CCGATGAAGA ACCTCCATGC GCAGATCAGC CGGACCCGCC AGCAATTCCT CCAGGTGATC GGCAACCGCA TCAGCGAACG GCGCAAGGAA CTGGAGATCA TCTCCGGGGT GATCGCCCAG AAGACCACGC GCATCGAGGA ATACCAGGCG CGCATCCGCG AGCTGCAGGA CGCGCAGAGC GGCCTGCGCC AGCTCAACAC CGAGATCGAG GCCCTGCACC AGGCCTTCTT CACCTACACC CAGCGCTACG AGGAAAGCCG CAGCCGCGCC CTGCTCGACG GCGGCCTGTC CAACGCCAAG GTGCTCAGCC GCCCCTTCGA GCCCAGCGAG GCGAGCTTTC CGCGGCCGCT GCAGATCATC CCTCTCGGCC TGCTCACCGC CCTGCTGCTG GCCGTCGCCG CGGGATGCCT GCGCGAATTC TTCGATCGCC GCTTCAAGTA TCCCGGGCAG TTGCAGAGCC AGCTCGGCCT GCCCGTGCTG ATGACCCTCA ACGCCGAGCA GCCGGCCGCG CTGCCCAATC CGCACAAACC CGGGAGCCTG CCATGGATTC GACACTGGGC GAGCGACTGA
|
Protein sequence | MDALRHENSP HAFFSALFAN RRLVKRVFLA FAALALLLPL LLGRSYEIGA EVMVQSKKVA QTEPNSATLQ QETDKFLPPT LADMETESSI LRSPELVRAT LESLIREGHF AEDKGLAGRL RDYLAVPLRE HVLDPLRTAL GLAADAPRDN RLDELTLAIL EDLEIIPLPG SNVIAVHYRS GDPALGTLFV NRLLDTYLVR RHALHSNELP EAFYEQKKTQ YQDQLNGLEA QRLALLERIR AANPEEEITF RLNAINQEEQ ALNQYRDRLL ENQRWVDYLQ GNLAVARKAR LTDYGFPYTF ANTLDNAAFE DREIRQLGDR LIEQIGRYGA ETDIYNPNSE PMKNLHAQIS RTRQQFLQVI GNRISERRKE LEIISGVIAQ KTTRIEEYQA RIRELQDAQS GLRQLNTEIE ALHQAFFTYT QRYEESRSRA LLDGGLSNAK VLSRPFEPSE ASFPRPLQII PLGLLTALLL AVAAGCLREF FDRRFKYPGQ LQSQLGLPVL MTLNAEQPAA LPNPHKPGSL PWIRHWASD
|
| |