Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50780 |
Symbol | |
ID | 7763926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5147205 |
End bp | 5148530 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643807906 |
Product | hypothetical protein |
Protein accession | YP_002802140 |
Protein GI | 226947067 |
COG category | [S] Function unknown |
COG ID | [COG3522] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03353] type VI secretion protein, VC_A0114 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.357651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTC TACCCGAGGC GGTCTGCTGG CACGAGGGCA TGCAGTTGCT GCCCCAGCAT TTCCAGCTCC AGGGCTTGCG CGCGGAAGCC CTGGCGGGAC ACCTGGCCAG GGTCTGCAAT CCCTGGTTCT GGGGCGTCGA GACGCTGGAG GTCGATCCTG CCGCCCTCTG TGCCGGTCGG GTGCGGGTGA CCGCCCTCGA AGCCACCCTG CCGGACGGCC TGCCGGTCGG CCTGAGCGCC GACGGCGGTG CGCCGCTGGA ACTGGATCTG GCCGAGGCGT TCGCCGACTC GGCCGCCGGC GTGACCGTCT ACCTCGCCGT GACGCCGCTG GGACGGGCCG GCCAGTTGCT GCCGCTGAAC GGACGCCTGC GCTCGGTGGT CGGTGACGCG CTGCCCGATC TGGCCAGCGG CGAGCATCCC GAGCCGATCG TCGTCTGGCG GCCGAATCCG CGCCTGGTCA CCGCCGAAGG ACGGGCCGAC TCGGTGTGCA TTCCCCTGCT GCGAGTGGGG CGGGAGGGCG GCGGCTATGT TCGTTTGCCC TATGTCGCCC CGATGCCGCG CATCCTCCCG GAGTCGCCGC TGGGCGAAAA GGTGCGGGCC CTGTGCGCCC GCGCCCGGGA GAAGTGCCTG TTCCTCGCCG GGCGGCTGCG CCAGGCCCGG CAGGCCGGCA ACGCCGAAGA CGCCGCCGAG CTTCGCCTGC AACTGGCGGC CCTGTGGGCG CGACTGCCCG AGGTGGAGGC GGCCTTGAAC GGACGCGGCA CCGATCCGGC GACCCTGCAC CGGCTGCTGG CCGGCATGGC CGGCGCCTGG TGCGCGCTCG ATCCGTTGGC CGGCGTCCCG GCGTTCCGTC CCCTGGAGTT CGAGGACCTG CTGCGCGGCT ACGACGAGGT GCTCGGCTGG CTGGCGGCGA CGCTGGCGCG GATCCGCGCG GGCTACCGGA GCCTGCCGTT CGAGCAGGAC CGGCATGGCT TCCATATCCT CCTGCCGGAT CGGGAGCGGC CCGGCCAGCG CCTGGCGATC GGCCTGCGCA TGCCGGCCGG GACCGGCGAG CAGGCGGCCC GGGACTGGCT GGCGCAAGCC ATCGTCGCTT CCGAGGCGCA TGTCGCGACT CTGGTTCGCC AGCGCATGGG CGGGCTTTCC TGGCATCCCA TGCCCCGGCA GGAACAGGTC GCCTACGGGG TCGGCGAGGA CACGCGGATC TTCGTGCTCA AGGCGGCCGG CGAGTGGTTC GATCCGAGCC AGGCGCTGCG CATCGTGCCC TGCGGCGGCG CGGCCGGCGG CCAGCCCTGG CAGATCGTGC TGTTCGCCGA CGCCGGCGAC CGCTAG
|
Protein sequence | MSVLPEAVCW HEGMQLLPQH FQLQGLRAEA LAGHLARVCN PWFWGVETLE VDPAALCAGR VRVTALEATL PDGLPVGLSA DGGAPLELDL AEAFADSAAG VTVYLAVTPL GRAGQLLPLN GRLRSVVGDA LPDLASGEHP EPIVVWRPNP RLVTAEGRAD SVCIPLLRVG REGGGYVRLP YVAPMPRILP ESPLGEKVRA LCARAREKCL FLAGRLRQAR QAGNAEDAAE LRLQLAALWA RLPEVEAALN GRGTDPATLH RLLAGMAGAW CALDPLAGVP AFRPLEFEDL LRGYDEVLGW LAATLARIRA GYRSLPFEQD RHGFHILLPD RERPGQRLAI GLRMPAGTGE QAARDWLAQA IVASEAHVAT LVRQRMGGLS WHPMPRQEQV AYGVGEDTRI FVLKAAGEWF DPSQALRIVP CGGAAGGQPW QIVLFADAGD R
|
| |