Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43730 |
Symbol | |
ID | 7763246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4418973 |
End bp | 4420313 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807228 |
Product | hypothetical protein |
Protein accession | YP_002801469 |
Protein GI | 226946396 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.959668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCAT CCCTGAAACG CAAGCTCGCC GTGGCCCTGC TGGCCCTCGG CGTCCACCAC GCCCTGCCCG GCAGCGCCCT GGCGGCGCCG AACGAGCAGT TCTTCCCGCT GGCCACCTAC CGGGTCGGCG CCTACGCCTC CAGCGGCATT CCGGTATGGG CCGGGATGAT CGACTACCTG CGCTACATCA ACGAGGTGGA GGGCGGCATC AACGGCGTCA AGCTGGTCTG GCAGGAATGC GAGACCGAGT GGACGGCGGA GAAGGGCATC GAGTGCTACG AGCGCTTCAA GAACGGCCTG GACGGCGCGC CGGTGGCGGT CTACCAGCCC AACGGCGCGC CGGCCGCCTA CGCCCTGGCC GACAAGGCCG CGGCCGACAA GATCCCGCTG ATCACCCTGG GCTACGGGCG CACCGAGGCC ACCGACGGCA GTGTGTTCCC CTACAACTTC CCGGTGATGC TGACCTTCTA CAGCGAGGCT TCGGCGGTGA TCGAGTACAT CGCCCGACGC GAGGGCGGCC TGGACAAGCT CAAGGGCAAG AAGATCGCCA CCGTCTACCA CGACTCCGCC TACGGCCGGG AAACCCAGGG GCCGCTGGCG CTGCTGGCCG AGAAGTACGG CTTCGAGAAC ATCCAGATCC CGGTGGCCGA TCCGGGCAAC GAGCAGTCCG CGCAGTGGCG CCAGGTGCGC CAGGCGAAGC CGGACTGGGT GTTCCTGCGC ACCTGGGGCG TGTCCACCCC GGTGGCGATC AAGACCGCGG CGCGCTTCGG CTTCCCGGTG GAGCGCATCA TCGGCGACAT CTGGGCCAGC TCCGACGAGG ACGTGCTGCC CACGGGCCCG GCCGGCAAAG GCTACCTGGC GCTCACCCCC TATCCGGGCG GCGCCGACTT CGAGATCCAC AAAAAGATCA AGGAGCACAT CCTCGACAAG GGCAAGAGCG ACCTCAAGGA CCCGAAGAGC TTCGGCAGCG TCTACTACAA CTCCGGGCTG GTCAACGCGG CCATCGCCGT CGAGGCGATC CGCGCCGGGC AGGCCAGATT CGGCAAGCGC CCGCTCGACG GCGACGAGAG CCGCTGGGGC CTGGAGCACC TGGACATCGA CGATGCCCGG CTCAAGGCCA TCGGTTTCCA CGGCCTGATG CAGCCGCTCA GGCTGTCCTG CTCGGACCAC GAGGGCGGCG GCGCGGCCAA GGTGCAGCAG TGGGACGGCA GCAGTTGGAA GCTGATCACC GACTGGGTGC AGGCCGACCG CCAGACCCTG CGTCCGCTGA TCGAAGCCAA GTCCGGCGTC TACGCCAAGG AAAAGGGCAT CGCGCCGCGC GACTGCCGCG CCGACGGCTG A
|
Protein sequence | MSSSLKRKLA VALLALGVHH ALPGSALAAP NEQFFPLATY RVGAYASSGI PVWAGMIDYL RYINEVEGGI NGVKLVWQEC ETEWTAEKGI ECYERFKNGL DGAPVAVYQP NGAPAAYALA DKAAADKIPL ITLGYGRTEA TDGSVFPYNF PVMLTFYSEA SAVIEYIARR EGGLDKLKGK KIATVYHDSA YGRETQGPLA LLAEKYGFEN IQIPVADPGN EQSAQWRQVR QAKPDWVFLR TWGVSTPVAI KTAARFGFPV ERIIGDIWAS SDEDVLPTGP AGKGYLALTP YPGGADFEIH KKIKEHILDK GKSDLKDPKS FGSVYYNSGL VNAAIAVEAI RAGQARFGKR PLDGDESRWG LEHLDIDDAR LKAIGFHGLM QPLRLSCSDH EGGGAAKVQQ WDGSSWKLIT DWVQADRQTL RPLIEAKSGV YAKEKGIAPR DCRADG
|
| |