Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_03750 |
Symbol | |
ID | 7759335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 354925 |
End bp | 355929 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643803299 |
Product | extra-cytoplasmic solute receptor, Bug family |
Protein accession | YP_002797610 |
Protein GI | 226942537 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.549978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACATT GCTCCCGACG GCGTTTTCTC ACTCTGGCGG GCGGCGCGCT GCTCGCCGCC CCCCTGGCCG GCCGCCTCGC CCTGGCGGCG CCGGGCGACT GGCCGCAGCG CCCGATCAGG TACGTCGTAC CCTGGCCGGC GGGCGGCCCG ACCGATACCT TCGGGCGGGT GATCGCCAAC GAGCTCGCCA CCCTGCTCGG CCAGCCGGTG GTGGTCGAGA ATCGTACCGG CGCGACCGGG GCCATCGGCG TCCGGCATGT CGCCCGCAGC GAGCCCGACG GCTACACGCT GCTGGCGCCG AACACCACCT CGCTGATCGG CAACGTCGTG GCGACGCCCG AAGCCGTCGA CTTCGACCCG TTGAAGGATT TCACCCCGAT CGGCCTGTTC GTCGACTCCT CGGTGGTGCT CTGGGCGCAG GCCTCGACCG GCATCGCGAA CTTCGCGGCC CTGCGCGAGC GCGCCCGCGA CGCGGAGCGT CCGCTCTCCT TCGGCACCAC GGGCGGCGGC TCGGTTTCGG AACTGTCGGT GGAACAGCTC GCCCGCCATT TCGGGCTGAA CCTGCTGAAA GTGCCATACA AGGGCACCGC ACCCCAGGTC GCCGACCTGG TCGCCGGGCA TATCGACATC GGCGTGGCCG ACTACCCGGT CGCCGCCGGG CATTTCGCCA GCGGCAAGCT GGTCCCCCTG CTGGTCATCG GCCGCCAGCG CCTGCCGGAA CTGCCGGAGG TGCCGACCAA CTTCGAGCTG GGTATCGAGG AGCCCGACTT CACGATCTGG AACGGCCTGT TCGCGCCGGC CGCGACACCG GCCCCGATCG TCGCCCGGCT GCGCGAAGCC CTGGCCGTCG CCGCCCGCAG CGAGGCCTTC CGCAAGGTCG CCGAGGGCCA GGGCAACCGG CCGATCTTCC AGACCGGCGA GGAAGCCAGC GCCCGCCTGC GCCGGGAGCT GGACAGCCGG CGGAAATTCA AGGAACAGAT CGAACGAGGC GTCCCGGCGG CCTGA
|
Protein sequence | MTHCSRRRFL TLAGGALLAA PLAGRLALAA PGDWPQRPIR YVVPWPAGGP TDTFGRVIAN ELATLLGQPV VVENRTGATG AIGVRHVARS EPDGYTLLAP NTTSLIGNVV ATPEAVDFDP LKDFTPIGLF VDSSVVLWAQ ASTGIANFAA LRERARDAER PLSFGTTGGG SVSELSVEQL ARHFGLNLLK VPYKGTAPQV ADLVAGHIDI GVADYPVAAG HFASGKLVPL LVIGRQRLPE LPEVPTNFEL GIEEPDFTIW NGLFAPAATP APIVARLREA LAVAARSEAF RKVAEGQGNR PIFQTGEEAS ARLRRELDSR RKFKEQIERG VPAA
|
| |