Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21410 |
Symbol | |
ID | 7761064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2141114 |
End bp | 2141992 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805032 |
Product | ABC transporter, substrate binding protein, family 3 |
Protein accession | YP_002799313 |
Protein GI | 226944240 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.334752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTCA GGCACGCTCA ACCATCCTCC GGCCGTCCCG CGCTCAAGGC GCTGGCCGCT TTGTTCGGCG CGACGCTGCT GGCCGTCGCC GGCAGCGCGG CGCACGCCGA CGCGACGCTG GAGAAGATCC GCGAGCGCAA CAGAATCAGC GTCGGGGTGA TCCTCAGCGG GCCGCCGTTC GGCACCATCG ACCCGGTCAC CCGCGAGCAC CTCGGCTACA ACGTCGAGCT GGCCAAGGGC ATCGCCAAGA CGCTCGGCGT CGGGCTGGAA ACCGTCTCGG TGCTGGCGCC GAACCGCGTG CAGTTCCTGC AGCAGGGCAA GGTCGACATC CTGATCGCCA ACATGCAGTT GACCGAGGAG CGCACGCAGA TCCTCGACTA CGTGCCGACC CCCTATGAAG AAGTCGGCGG CGGCGCGCTG ATCCGCAAGG GTAGCGGCAT CGCGAACTGG GAAGACCTCA AGGGCAAGCC GGTGTGCGTG TCCCAGGGCA GCAACTTCAT CAAGCCGCTG GTGGAAACCT ACGGCGCCGA GATCAAGGCG TTCCGCAGCC AGTCGGAATC GCTGCTCTCG CTGCGCGGCA ACGGCTGCGT CGCCGCCGTG CACGTCGGCC CGACCATGCG CACTCTGCTC AAGGAACCGG AGTGGGCCGA CTACGAGCTT CCGCTGCCGA ACGACCTGAT CCCGTCGCCT TCGGTGATCT GGGTGCGCAA GGGCGAGAAG GACATCCAGG CCCGGCTCGA TGCCATCGTC CGCGACTGGC ACCGCAGCGG CTGGCTGCTC GAGGTCGGCG AGCGCAACGG CCTGCAGCCG TCGCAGGCGC TGCGCGACCT GCACGAGAAA TACCGCGACG CGGCCCCCCT GGCGGACAGC CGGCAATGA
|
Protein sequence | MFFRHAQPSS GRPALKALAA LFGATLLAVA GSAAHADATL EKIRERNRIS VGVILSGPPF GTIDPVTREH LGYNVELAKG IAKTLGVGLE TVSVLAPNRV QFLQQGKVDI LIANMQLTEE RTQILDYVPT PYEEVGGGAL IRKGSGIANW EDLKGKPVCV SQGSNFIKPL VETYGAEIKA FRSQSESLLS LRGNGCVAAV HVGPTMRTLL KEPEWADYEL PLPNDLIPSP SVIWVRKGEK DIQARLDAIV RDWHRSGWLL EVGERNGLQP SQALRDLHEK YRDAAPLADS RQ
|
| |