Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18790 |
Symbol | |
ID | 7760813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1863283 |
End bp | 1865016 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643804777 |
Product | tetratricopeptide (TPR) repeat and VWA domain-containing protein |
Protein accession | YP_002799066 |
Protein GI | 226943993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00254687 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCC TCTGGCCGCA CTGGCTGCGC CCCGACTGGT TGCTCCTGCT GCCCCTGCTC GCCTGGCTGC TCTGGCGGCT GTGGCGCCGC GAACGGCGCG CCGGACGCTG GGAACTGCTG CTGCCGCCGG CCTTTCACCA GGCCCTGCTC GGCGCGCGCA GCGGACGCGG CAGCCGCCTG CCGTGGATCG CGCTCGGGCT GGGCTGGCTG CTCGCCCTGC TGGCCCTGCT CGGGCCGAGT TGGCAACGCA TCGAACAAAG CCCCCTGAAG CGCGCCGATC CGCTGGTGGT ACTGCTCGAA CTGACCCCGA GCATGCTCGC CGGCGACGTC GCCCCGAACC GCCTGGCGCT AGCCCGGCAC AAGCTGCTCG ACCTGCTCGA GGCGCGCCAG GAGGCGCAGA CCGCGGTGGT CGTCTACGCC GGCAGCGCAC ACACCCTGGT GCCGCTGTCC GACGACCTGG AAACCACCCG CAATCTGCTG GACGCCCTCG CCCCGCCGCT CATGCCGGTT GCCGGGCGGC GCGCCGACCT CGCCGTGGCC CGCGGCCTGG CCCTGCTCGA ACAGGGTGCC CAGGGCCGTG GCCAGTTGTT GCTGATCGGC AGCGAACTGG ACGAACGGGA ACGCCAGGGC ATCGCCCAGG CGCTCGGCGA CGACGGCGAG CGCCTCGCCA TCCTCGGCAT CGGCAGCCCT GACGGCGCGC CCATCGCCCT GGAGGACGGC AGCTTCCTCA AGGACGGACA GGGCGCGATC CTCCTCGCCC GGCTGGACAG CGGCGGACTC GCGCGCTTCG CCGACAGCCT CGGCGGGCGC TACCAGAGCG CCGGCCTGGA CGACCGCGAC CTGCAACGGC TGGGCCTGCT CCAGGGGCCG CGACTGCTGC GCGAGGGCGA CCAGACGACA CGCCTGGACG CCTGGGCCGA CCAGGGCCAC TGGCTGCTGC TGCCGCTGCT CCTGCTGGCC GCCTGCGCCG GCCGGCGCGG CTGGCTGCTC GCCCTGCCCC TGCTGTTCGT CCTGCCCCAG CCCGGCCAGG CCTTCGAGCT CACCGACCTG TGGCTGCGCC CCGACCAGCA GGGCCGCATC CTGCTCCAGG CCGGCCGGCC GGGCGAGGCC GCGCGGCGCT TCGAAGACAG CCAGTGGCAG GGCCTGGCCC TCTACCAGGC CGGCGACTAC GCCGCCGCCG CCGAGCGCTT CGCCCAGGGC CAGGGGGCCG CCGCCCACTA CAACAGCGGC AACGCCCTGG CTCGGGCCGG CGAACCGGAG GCCGCGCTGG ACGCCTACGA GCGTGCCCTG GAACTGCAGC CGGCGCTGGA AGCGGCGCAG CACAACAAGG CGCTGGTCGA GGCGCTGCTG CGCCAGCGGC AGGCGCGTCA GCCCGACGCC GATGGCAGTG CCGAGCGGCA GCAGGCCCCG CCGAACCGGG ACAGCCCGTC CGGCCAGGCT GGGCAGAACG ATATCCGGGC GGACACGACG GCCTCCCAGG CCGCAGCGGA GGCAGCCCCG GACGAAGCGA CGGCGCAGCC GGGCACGCCC GGCTCGCTAT CGGCCGAGGG ACAGACCGCG GGCGATACGG CGCCGGGCCC CGCCGCGGCC GCCGGAGCCG TGTCCGCGCC CGTCACCGAG GAACGCCGCC AGGCCCTGGA ACAATGGCTG CGGCAGATCC CCGACAATCC CGGAGAATTG CTGCGGCGCA AATTCTGGTA CGAACAGCAA CAGCACCGGG AAACCAGCCC ATGA
|
Protein sequence | MNILWPHWLR PDWLLLLPLL AWLLWRLWRR ERRAGRWELL LPPAFHQALL GARSGRGSRL PWIALGLGWL LALLALLGPS WQRIEQSPLK RADPLVVLLE LTPSMLAGDV APNRLALARH KLLDLLEARQ EAQTAVVVYA GSAHTLVPLS DDLETTRNLL DALAPPLMPV AGRRADLAVA RGLALLEQGA QGRGQLLLIG SELDERERQG IAQALGDDGE RLAILGIGSP DGAPIALEDG SFLKDGQGAI LLARLDSGGL ARFADSLGGR YQSAGLDDRD LQRLGLLQGP RLLREGDQTT RLDAWADQGH WLLLPLLLLA ACAGRRGWLL ALPLLFVLPQ PGQAFELTDL WLRPDQQGRI LLQAGRPGEA ARRFEDSQWQ GLALYQAGDY AAAAERFAQG QGAAAHYNSG NALARAGEPE AALDAYERAL ELQPALEAAQ HNKALVEALL RQRQARQPDA DGSAERQQAP PNRDSPSGQA GQNDIRADTT ASQAAAEAAP DEATAQPGTP GSLSAEGQTA GDTAPGPAAA AGAVSAPVTE ERRQALEQWL RQIPDNPGEL LRRKFWYEQQ QHRETSP
|
| |