Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43720 |
Symbol | |
ID | 7763245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4417541 |
End bp | 4418881 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807227 |
Product | hypothetical protein |
Protein accession | YP_002801468 |
Protein GI | 226946395 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.35207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGACT CCCTGCGCCG CGTCCTCGCC GGCGCCGCCC TCGCCCTCGG CGCCCATACC GCCCTGCCCG CGACCGCCCT GGCGGCGCCG GACGAGCAGT TCGTTCCGCT GGCCACCTAC CGCGTCGGCC CCTATGCCTC CAGCGGCATT CCCTGGTGGG CCGGCACCAT CGACTACCTG CGCTACGTCA ACGAGGTGGA GGGCGGCATC AATGGCGTGA AGATCGCCTG GCAGGAATGC GAGACCGAAT GGAGCACCGA CCGCATCGTC GAGTGCTACG AGCGTTTCAA GAACGGCCGC GACGGCGCGC CGGTGGCCTT CTTCCTCACC CACAGCACCC CGGCCTCCTA CGCGCTGATG GAGAAGGGCG CGGCGGACAA GATCCCGCTG ATCGACCCGG CCGGCGGGCG CACCGAATCC ACCGACGGCA GCGTGTTCCC TTACGCCTTC CCGCTGCTGC TCACCTACTA CGGCCAGGCT TCGGTGGGCA TCAACTACAT CGCCCAGCGC GAAGGCGGCT TCGACAAGCT CAAGGGCAAG AAGATCGCCA CCGTCTACCA CGACTCGCCC TACGGCCGGG AAACCCAGGC GGCAATCGAG TTGCTGGCGC AGAAGTACGG CTTCGAGAAC ATCCAGATCC CGGTGGCCCA CCCCGGCAAC GAACAGTCGG TGCAGTGGCG CCAGGTGCGC CAGCTCCAGC CGGACTGGGT GTTCCTGCGC ACCTGGGGCG TGTCGACCCC GGTGGCGATC AAGACCGCCG CGCGCTTCGG CTTCCCGGTC GAGCGCATCA TCGGCGACGT CTGGGCCGGC TCCGAGGCCG ACGTGATCCC GGCGGGCGCC GCCGCCAAGG GCTACCAGGC GCTGGCGCCG TTCCCCGGCG GCGACGGCTT CGAGATCCAC AAGCGCCTGC GCGAGCAGAT CCTCGACAAG GGCAAGAGCG ACCTCAAGGA CCCGAGTTAC TTCGGCAAGG TCTACTACAA CATCGGCCTG GTCAACGCGG CCATCGTGGT CGAGGCGCTG CACGCCGGTC AGGCGAAGTT CGGCAACCGG CCGCTGAACG GCGAGGAAGG GCGCTGGGCC TTCGAGCACC TCAAGGTCGA CGCGGCGCGC CTGAAGCAGA TCGGCTTCGA CGGCCTGCTG CAACCCCTGC AGGTGAGCTG CGCGGACCAC GAAGGCGGCG GCGCCGCCCG CGTGCAGCAA TGGGACGGCG AGAAATGGGT GCTGGTCAGC GACTGGGTGC AGGCCGACCG CGACACCCTG CGGCCGCTGA TCGAGGCCAA GTCCGCCGCC TACGCGAAGG AGAAAAGCAT CACCCCGCGC GATTGCGCCA CGGCGAACTG A
|
Protein sequence | MRDSLRRVLA GAALALGAHT ALPATALAAP DEQFVPLATY RVGPYASSGI PWWAGTIDYL RYVNEVEGGI NGVKIAWQEC ETEWSTDRIV ECYERFKNGR DGAPVAFFLT HSTPASYALM EKGAADKIPL IDPAGGRTES TDGSVFPYAF PLLLTYYGQA SVGINYIAQR EGGFDKLKGK KIATVYHDSP YGRETQAAIE LLAQKYGFEN IQIPVAHPGN EQSVQWRQVR QLQPDWVFLR TWGVSTPVAI KTAARFGFPV ERIIGDVWAG SEADVIPAGA AAKGYQALAP FPGGDGFEIH KRLREQILDK GKSDLKDPSY FGKVYYNIGL VNAAIVVEAL HAGQAKFGNR PLNGEEGRWA FEHLKVDAAR LKQIGFDGLL QPLQVSCADH EGGGAARVQQ WDGEKWVLVS DWVQADRDTL RPLIEAKSAA YAKEKSITPR DCATAN
|
| |