Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18850 |
Symbol | |
ID | 7760819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1869812 |
End bp | 1872442 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643804783 |
Product | hypothetical protein |
Protein accession | YP_002799072 |
Protein GI | 226943999 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.11879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACCC TGAGCAAACG ATCGGCCGTC TCTGTCTGCA TCAAGGCGCT GCAAGCCCTC GCGCACCGGC TCCGCCCCCT CGGCGAATAC CGCACCGTCC TGGCCTCGCT GTTCACCCTG CTGGTCTTCG GCCTGGCGCT GATCGCCTGT CGCCGCCTGC TCGACAGCCT GAATCCCCAG CATCTGCACG AAGCCCTGCT CGCCATACCG GGCACCGACC TGGCGCTGGC CCTGCTGGCC ACGGCGGCCG GCTTCGTCGC CCTGCTCGGC TACGAATGGT CGGCCTGCCG CTACGCCGGC GTGCGTCTGC CGCCTCCCAG CCTGCTGCTC GGCGGCTTCT GCGCCTTCGC CATCGGCAAC GCCACCGGCC TCTCGCTGCT CACCGGCGGC GCCGTGCGCT ACCGGCTATA CGGCCAGCAG GGCGTGGCCG CTGCGGAGAT CGCCAGGATC AGCCTGTTCG CCAGCCTGTC GCTGGGCTGC GCGCTGCCGC TGCTGGCCGC CGGCGCCGCG CTGCTCGACC TGCCGCAGGC GAGCCGCGCC CTGCATCTGG CCGCTGGCTC CCTGGCGACC CTGGCGGTGG CCGTGTTGTC CGGCTACGCG GCGCTGGCGC TGTTCCTCTG GCGCCACGCC ACGCCGGACG AACGGAACCC GCAGGTCATC CGCGCCAGCT TCGGTCCCTA CAGCCTGGGC CTGCCCGCGG CGCGCCTGAC CCTGCTGCAG TTGCTGATCG CCTGCGCCGA TGTGGTGGCC GCCGCCTTCG TGCTCCATGT CCTGCTGCCG GAGGGACCGT CCTTCGGCAG CTTCCTGCTG GTCTACCTGG TGGCGCTGGC CGCCGGCGTG CTCAGCCATG TGCCGGGCGG CATCGGCGTG TTCGAGGCGG TGCTGCTCGC CGCCTTCGGC CAGAGCCTCG GCACGGCGCC GCTGGCCGCC GCGCTGCTGC TCTACCGGCT GATCTACACG GGCTTGCCGC TGCTGCTGGC CTGCCTGCTG CTGCTCGGCA ACGAAGCCCG CCGGCTGACC CGCGCCGGCC TGCGCCTGGT CTCCGGCCTG GCCGCGCCGG TGCTGGCCCT GCAGGTGTTC ATCGCCGGCA TCGTGCTGCT GTTCTCCGGC GCCACGCCGA GCCTCGACGA GCGCCTGGAG CCGCTGAACT ACCTGGTGCC CCTGCAACTG ATCGAACTCT CGCACCTGGG CTCCAGCCTG ATCGGCGTGG TCTGCCTGCT GCTCGCCCAG GGCCTGCGCC GGCATCTCTC GGCGGCCTGG GCGCTGACCC TGGTGCTGCT GCTGGCCGCC GCCCTACTCT CGTTGCTCAA GGGCTTCGAC TGGGAGGAGG CCAGCGTGCT GATCGGCATC GCGGCGCTGC TGGCGCTGTT CCGCAAGGCC TTCTACCGCC CCAGCCGGCT GCTCGAACTG CCCGGCTCGC CGCCGATCAT GCTGGCGACC CTCGGCGTGC TGGTGGCCAG CACCTGGCTG CTGCTGTTCG TCTACCAGGA CCTGCCCTAC AGCCACACCC TGTGGTGGCA GTTCGAGCTG GACGGCAACG CCCCCCGCGG TCTGCGCGCC CTGCTGGGCA GCAGCCTGAC GCTGCTGGTG GTCGGCCTCG TCTGGCTGCT GCGCAGCCCG CCGCCGCCAC AGAACCTGCC CGGCCCCGCG GACCTCGAAC GGGCCTTCGC CGCGGTCCGC GCCTCGCAGC AGCCGGAGGG CGCGCTGGCG CTGTCCGGCG ACAAGGCACT GCTGTTCGAC CGCGAGCGCG ACGCCTTCCT CATGTACGCC CGGCGCGGGC ACAGCATGGT GGCGCTGTTC GACCCGGTGG GCACGCCGCA GCAGCGCGCC GAACTGATCT GGCAGTTCCG CGACCTGTGC GACCTGCACT ACCGGCGACC GGTGTTCTAC CAGGTGCGCG CCGAGAACCT GCCGCACTAC ATGGATATCG GCCTGATCGC CATCAAGCTG GGCGAAGAGG CGAAGGTCGA CCTTCGCCGC TTCGACCTCG ACAACCCCGG CAAGCACATG AAGGACCTGC GCTACACCTG GAACCGCTGC CGGCGCGACG GCCTCGACCT GGTGTTCCAC GAGCGCGGCC AGGCGCCGCT GGCCGAGCTG GAGGAGGTCT CCAGGGCCTG GCTGGCCGGC AAGACCGGCC GCGAGAAGGG TTTCTCCCTC GGTCGCTTCA GCCCGGACTA CCTGCAGTAT TTCCGCATCG TGGTGGCGCA CCATCAGGGT CGCCCGGTGG CCTTCGCCAA CCTGCTGGAA ACCGACAGCC CGGAAATCGC CGGCCTCGAC CTGATGCGCG TGCATCCGGA CGCACCGAAG CTGACCATGG AATTCCTCAT CCTCGGCATC CTGCTGCATT TCAAAGAGCG CGGCGGCGAC TTCTTCAGCC TCGGCATGGT GCCGCTCTCC GGCATGCTGC CGCGGCGCGG CGCGCCGCTG CCGCAACGCC TCGGCGCGCT GCTGTTCGAG AATTCCGAAT ACTTCTACAA CTTCCAGGGC CTGCGCCGCT TCAAGGAGAA ATTCGATCCG CAGTGGGAGC CGCGCTATCT CGCGGTGCCC GCCGGCATCG ATCCGCTGCT GGCCCTGGCC GACACCGCCG CGCTGATCGC CGGCGGCCTT TCCGGACTGG TGAAACGCTG A
|
Protein sequence | MDTLSKRSAV SVCIKALQAL AHRLRPLGEY RTVLASLFTL LVFGLALIAC RRLLDSLNPQ HLHEALLAIP GTDLALALLA TAAGFVALLG YEWSACRYAG VRLPPPSLLL GGFCAFAIGN ATGLSLLTGG AVRYRLYGQQ GVAAAEIARI SLFASLSLGC ALPLLAAGAA LLDLPQASRA LHLAAGSLAT LAVAVLSGYA ALALFLWRHA TPDERNPQVI RASFGPYSLG LPAARLTLLQ LLIACADVVA AAFVLHVLLP EGPSFGSFLL VYLVALAAGV LSHVPGGIGV FEAVLLAAFG QSLGTAPLAA ALLLYRLIYT GLPLLLACLL LLGNEARRLT RAGLRLVSGL AAPVLALQVF IAGIVLLFSG ATPSLDERLE PLNYLVPLQL IELSHLGSSL IGVVCLLLAQ GLRRHLSAAW ALTLVLLLAA ALLSLLKGFD WEEASVLIGI AALLALFRKA FYRPSRLLEL PGSPPIMLAT LGVLVASTWL LLFVYQDLPY SHTLWWQFEL DGNAPRGLRA LLGSSLTLLV VGLVWLLRSP PPPQNLPGPA DLERAFAAVR ASQQPEGALA LSGDKALLFD RERDAFLMYA RRGHSMVALF DPVGTPQQRA ELIWQFRDLC DLHYRRPVFY QVRAENLPHY MDIGLIAIKL GEEAKVDLRR FDLDNPGKHM KDLRYTWNRC RRDGLDLVFH ERGQAPLAEL EEVSRAWLAG KTGREKGFSL GRFSPDYLQY FRIVVAHHQG RPVAFANLLE TDSPEIAGLD LMRVHPDAPK LTMEFLILGI LLHFKERGGD FFSLGMVPLS GMLPRRGAPL PQRLGALLFE NSEYFYNFQG LRRFKEKFDP QWEPRYLAVP AGIDPLLALA DTAALIAGGL SGLVKR
|
| |