Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50840 |
Symbol | |
ID | 7763932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5152365 |
End bp | 5154158 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643807912 |
Product | hypothetical protein |
Protein accession | YP_002802146 |
Protein GI | 226947073 |
COG category | [S] Function unknown |
COG ID | [COG3519] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03359] type VI secretion protein, VC_A0110 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.232659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACT CGATTGACGA GGTCCTGCTC GACTACTACC AGCGCGAACT GACCTGGCTG CGGCATGCCG GCGCCATTTT CGCCGAGCGC TATCCCAAGG TCGCCCGACG CCTGGAGCTG TCTCCCGGCG AGTGCCCGGA CCCCCATGTG GAACGCCTGC TGGAAGGCTT CTCCCTGCTC GCCGCGCGCC TGCAGCGGCG CCTCGACGAC GACTACGCCG AGTTCAGCGA CGCCCTGCTG GAACAGTTGT ATCCGCTGGC CCTGCGCCCT TTGCCCTCCT GCGCCATCGT CCAGTTCGAG CCGGACCCGA CCAAGGGCAA TCTCGCCGAG GGTTATCGCT TGCCGCGCGA TACCCCGCTG TTCGTCACCG GCGCCGACGG CGCCAGCGTG CACTTCCGCA CCAGCGCCGA GGTCGAGCTG TGGCCGCTGC AGATCGTCGA GGCGACCCTC CTCGCCGGCG ACGAAGCCTG CGCCCTGACC GGCGTGGCGC CGGCCCGTTC GGCCCTGCGC CTGAGCCTGC GCTGCCTGGG CGGGTGCCGC TGGCCGGAGC TGCCCGTGCG TCGCCTGCGC CTGCATCTGG CTGCGTCACC GATGGTCAAC GCCAGCCTCC ACGACCTGCT GGGCGCCCAT GCCCTGCAGA TGCTCGCCGG AGTGCCGGGC AGCCTGCCGC AGGCTTTGCC CGGGTTGCCG CAGGCGGTCG GCTTTTCCGC CGCCGAAGCC TTGCTGCCGG ACGAGGACGG ACTGCACCCC GGTCTGCGTC TTCTGGCCGA ATACTTCGCC TTCCCGGACA AGTTCGCCTT CTTCGACCTG CCCGTGCAGG CGCCGTCCGG CGCGAGCGAA GAACTCCAGT TGTACATCGT CTTCGACCGC GCCCCGGCCG GCCGGCTACA TCTGCAGGCG GCCGACTTCG CTCTCGGCTG CGCGCCGCTG GTCAACCTGT TCCCGCGCAC CTCGGAGCCG CTGCGCCCGG ACGGCACCCG CAGCGAATAC CGGCTGGTCG CCGACAGCCA CCGGGAAAAC AGCATGGAGA TCCACAGCAT CCGCGCGCTG CGCGCCTGCT CGGCAGAGGG CGTGCGCCAG GTGCCGGCCT ACCATGGCTG CCAGCACGCC CTCGGCGAGA GCCGCCTCTA CTGGCACGCC CGCCGCGTCG ACGGGCTGAC GCCGAACCGC CTGGGCAGCG ACCTGCTGCT GAGCCTGGTC GACACCCGTT TCGACCCGCA GCGCGAGGCG CCCGACTACA GCTTGACCGC CGAGCTGCTG TGCACCAGCC GACACCTCGC CGAGGCTCTG GGCGCCGGTA CCCGGCTGGA TTTCGAGCGC CCCGGCCCGG TGGCCCGCGC CCGCCTGCGC AATCCGCCGA CGCCGCAAAG CCTGCCGCGC CTGCGCGGCG AATCGCGCTG GCGGCTGGTC TCGCAACTGA GCCTCAACCA CCTGTCGCTG GTGGAGGGAG AACGGGCCCT GGACGCACTC AAGGAAATGC TCCAGTTGCA CAACCTGCGC GACGAGCCCG GCGCGCGCCG GCAGATCGAC GGCCTGGTTC GCCTCGACTG CGAACGCATC GTCGCCCAAG TCGGCGAGGA CGCCTGGCGC GGCTGGCGCA ACGGCCTGGG CGTGCGCCTG CATCTCGACC CGCAGCATTT CGTCGGCAGC AGCGCCGTGC TGTTCTCCGC CGTCCTCGCG CAGTTCTTCT CGCTCTACGC CAGCGCCAAT CGCTTCGTGC GCACCACGCT GGTCCTGGCC GACAAGGAGA TCAAGACGTG GCAGCCACAG GCCGGAATGC CGCTCGTCCT CTGA
|
Protein sequence | MSDSIDEVLL DYYQRELTWL RHAGAIFAER YPKVARRLEL SPGECPDPHV ERLLEGFSLL AARLQRRLDD DYAEFSDALL EQLYPLALRP LPSCAIVQFE PDPTKGNLAE GYRLPRDTPL FVTGADGASV HFRTSAEVEL WPLQIVEATL LAGDEACALT GVAPARSALR LSLRCLGGCR WPELPVRRLR LHLAASPMVN ASLHDLLGAH ALQMLAGVPG SLPQALPGLP QAVGFSAAEA LLPDEDGLHP GLRLLAEYFA FPDKFAFFDL PVQAPSGASE ELQLYIVFDR APAGRLHLQA ADFALGCAPL VNLFPRTSEP LRPDGTRSEY RLVADSHREN SMEIHSIRAL RACSAEGVRQ VPAYHGCQHA LGESRLYWHA RRVDGLTPNR LGSDLLLSLV DTRFDPQREA PDYSLTAELL CTSRHLAEAL GAGTRLDFER PGPVARARLR NPPTPQSLPR LRGESRWRLV SQLSLNHLSL VEGERALDAL KEMLQLHNLR DEPGARRQID GLVRLDCERI VAQVGEDAWR GWRNGLGVRL HLDPQHFVGS SAVLFSAVLA QFFSLYASAN RFVRTTLVLA DKEIKTWQPQ AGMPLVL
|
| |