Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_24440 |
Symbol | |
ID | 7761359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2439906 |
End bp | 2441711 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643805329 |
Product | extracellular solute-binding protein |
Protein accession | YP_002799606 |
Protein GI | 226944533 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGGGT GGCCGAACGC ATCGCGCGAT ACGCGACGGG CCGGGTATCT CCCTATTCAT TTTTCCGATT ATCCCGACCC GGAAAAGCGC CCATCCGCTT CGCCGGGCGG CCATCCCCGC CCGGAGCAGG CCGAGGAGCC TGGATTTCCC GGTGGGCGAG CGCTAAGGAG CAACATGAGC ATTTTGCGTA AGCTGGTCGC CGGTGGCGCG GCGACCCTGG CCTTGCTGGG AACCGCCGCG GTGCAGGCCG GTGCGGCCAA GGACGAGAAA GACAGCCTGA TCTACCTGCA CTCGATCGAG CCGAAGACCT TCTACCAATG GTGGACGCAG GCGGAATACC CGCGCCGCCA GCTTCTCGAC GGCCTGATTT TCCTCGACGG GGAAGGCAAG CTGCACCCCT GGCTGGCCAA GAGCTGGAAA CAGGACGGCA CGGTATGGAC CTTCGACCTG CGCGACGACG TGGTGTTTTC CGACGGCTCG AAATTCAACG CCGAGACCGT GGTGAAGAAC GTCGAGTTCT GGCTCAAGGT TTCCACCTCG GTGCCGGACT CCTTCTTCAA GGAAGCCAAA GCCGTCGACG AATACAAGGT GGAAATCCAC ACCACCATTG CGCAGCCCTG GCTGGCCAAC CTCTTGTCGA GCGGCGGCTT CGCCATCAAC TCCAGCCCCT CGCTGGCCCG CGATCTCAAG GAAATCGGCG AAAACCCGAT AGGCAGCGGC CCCTTCGTGC TCAAGGAATG GAAGCGCGGC GAAGAGATCG TCCTGGTTCG CAACGAAAAC TACCGCTGGG GCCCGGAAAC GACGCATGGC GGCCCGGCCC ATCTGAAGAC CATCCACTGG AAGTTCGTGC CGGACGCCAA TGCGCGCTGG CTGGCGCTGG AAAAGGGCGA GGCCGACCTG ATCTACGACC CGCCTTCGGT CAAGTGGAAG GAAGCGACCG GTAAATACCC GACTTCGACC CGATACGCGC CGGGCCGGGG TCAGACGCTC TCGCTCAATA CCGAGTTCGG CCCCTTCGCG GACAAGCGCG TGCGCCAGGC CTTCGCCTAC GCCAGCAACC GCAAGAAAAT CGTCGAGACC CTGTTCCGTG GCTCGGCCCT CTACGAAGGC AACGGCGCCT ATTCGCGAAC CACGCCCGAT TACGTCGACC TGGACGATGC TTATCCCTAC GACCCGGACA AGGCCGTCGG GCTGCTGGAG GAAGCGGGCT ACACCCGGGT CAACGGCGAC GGCTTCCGCG TCGGGAAGGA CGGCAAGGTG CTGGAGGTCC TGTTCCCCGT GTACCCGACC ATCGTCAGCC CGGAAGGCTA TACCTCGCTG CAGGCGTTGC AGGCTGAAGC GAAGAAGGTC GGCTTCAAGA TCGACCTGAT CGCCCTGACC CCCACCGACC TGGCCGCCGG CCGCTATACC AAGCCGGACG AATACCACGT CTACCTGGGC TACTGGACCA TGTATGCGCC GACGGTGCTT TCCGTCAATT ATCGCCCCGA TGACGGTTCG GCTTCGGGGA CCATCTTCGG CCGGCAGAAC CTCAACCAGA TCCAGACCAC GGGGGGCTCG CCCAACCCGC ACAACCGCGT GCGTTCGAAG GACTGGAAAC TGCAGGAAGC CATCGTCGAG GCGCACCGCG AGCCGGACCC GCAGGCACGC CACGCGAAAC TGGCCGCCAT CCAGCAGCAC ATCAGCGACG AGGCGCTGGC GCTGGGTTTC TACACCTCCA CCTATAACCT GGTGGGCCAG AAATACCTGA GCGGCCTGAT CCACAACATC CATGGCCCGA TTTTCTACGC GTTGAAGAAA GACTAG
|
Protein sequence | MRGWPNASRD TRRAGYLPIH FSDYPDPEKR PSASPGGHPR PEQAEEPGFP GGRALRSNMS ILRKLVAGGA ATLALLGTAA VQAGAAKDEK DSLIYLHSIE PKTFYQWWTQ AEYPRRQLLD GLIFLDGEGK LHPWLAKSWK QDGTVWTFDL RDDVVFSDGS KFNAETVVKN VEFWLKVSTS VPDSFFKEAK AVDEYKVEIH TTIAQPWLAN LLSSGGFAIN SSPSLARDLK EIGENPIGSG PFVLKEWKRG EEIVLVRNEN YRWGPETTHG GPAHLKTIHW KFVPDANARW LALEKGEADL IYDPPSVKWK EATGKYPTST RYAPGRGQTL SLNTEFGPFA DKRVRQAFAY ASNRKKIVET LFRGSALYEG NGAYSRTTPD YVDLDDAYPY DPDKAVGLLE EAGYTRVNGD GFRVGKDGKV LEVLFPVYPT IVSPEGYTSL QALQAEAKKV GFKIDLIALT PTDLAAGRYT KPDEYHVYLG YWTMYAPTVL SVNYRPDDGS ASGTIFGRQN LNQIQTTGGS PNPHNRVRSK DWKLQEAIVE AHREPDPQAR HAKLAAIQQH ISDEALALGF YTSTYNLVGQ KYLSGLIHNI HGPIFYALKK D
|
| |