Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5381 |
Symbol | |
ID | 7381487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 382841 |
End bp | 384415 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643648995 |
Product | ABC transporter substrate binding protein (dipeptide) |
Protein accession | YP_002547232 |
Protein GI | 222106441 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.121189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAA CACAGACTTT AAGGCACCTT GCTGTGGTTT TGGCGATGGG AACCGCCATT GTCGCGCCAC ACGTGGCCAT GGCAAAAACC TCCACCGTGA TGAATGTAAC GCAGGTGTTT GGCACCATCG ATCCGGCCAA GATCACCGAT TACACTCAGT ATCTGGCTGC TGTGAACCTT TACGATGGCC TGACCACCGT GGACAGCACC GGCAAGATCA TCCCGGAACT GGCCGAGAGC TGGGATGTGT CCAGCGATAA CCTGACCTAC ACCTTCCATC TGCGCAAGGA CGCAACATTT CAGGATGGCT CTCCGGTTGA GGCCAAGGAT GTTGTTTATA CCGTCCAGCG CTTGCTGGCG ATCAACAAGG GACCGGCCTA TCTGTTTGCA ACGCTGATCA ACCCCGACAA TGTGAAGGCG GTGGATGCTC ACACCGTGAC CATCACACTC AACAAGGTCT ATGCGCCGTT CCTGACCACC ACGCCGCTGC TGCTTGTCAT CAATGAGGAT GCCGTCAAGG CCGCTTCCAA GCAGCCCTGG GGTGAGGATG TTGTTGGCGA AAAATCCATG GGCGCGGGCG CTTATGTGCT GTCCAGCTGG CAGCGCGGCT CGGAAATGGT CATCAGCCGC TATGAGAAAT ATTACGCGGG CTGGCCAACA AACCCGATTG ATGAAGTGCG CTTTGTGCAG ACCAATGACG AGGCAACCGT CAAGGCGCTG GCGACATCTG GCCAATTGGG CATTTCCTCC ACCACCCAGG CCAACGAGAC CTATGATGCT CTGGCCAAAA CCGACGGTTA TGTGGTCCAG ACCACACCAA CGGCCACCGG TTTTTATTTG AAACTCAACA CCAAAGCCAC GCCGACAGAC GATGTGCATG TGCGTCGCGC CTTGCAATAT GCCACCGATT ACAAAACCAT CCAGACACAG ATTATGACCG GTGACACGCT GGCGGGGCCG CTGGCGCAGG TGTTCAAGGA TGCCTATCTC GATACGCTGA AAGCGCCGGA ATTTGATCTT GAAAAGGCCA AGGCCGAGCT TGCCCAATCC AAATATGCGG GAAAGCCGAT CAAGCTGACG CTGACCTATG TGGCAGGCCT GTCGTTTGAA GAGGATATCG CACTTCTGAT GCAATCCAAT CTTTCGCAGA TCGGCGTGGA TGTGGACATC AAACCAGAGC CCTGGAACCG CATCACCGAA CTGGCGGCCA AGCCGGAAAC AACGCCTGCG GCCACGCAAG TGTTCTATGG CCCAACCTAT CCCTCGCCAG ATAGCGTGTT CTACGTGCAA TATCATTCCA AATCGGCGGG AACATGGGCC TCCATGGAAT GGTTGCAGGA TGCCGAGGTT GATAAATTGA TCGATGAGGC CCGCTCCACC ACCGACAGTG CCAAGCAGAA TGCGATCTAC AAGCAGATCC AGCAGGCGAT TTCTGATGAG GCACCGGACG TGAATTTGCT GACGAAAGTG CAGAAGGTGG CCTTCAGCAA GTGCATCTCG GGCTATAAAT TTGTGCCAAT GCAGAGCTGG GATTACAATT TCCACGATCT GACATGGACG TGCCCGGCCA AATAA
|
Protein sequence | MNATQTLRHL AVVLAMGTAI VAPHVAMAKT STVMNVTQVF GTIDPAKITD YTQYLAAVNL YDGLTTVDST GKIIPELAES WDVSSDNLTY TFHLRKDATF QDGSPVEAKD VVYTVQRLLA INKGPAYLFA TLINPDNVKA VDAHTVTITL NKVYAPFLTT TPLLLVINED AVKAASKQPW GEDVVGEKSM GAGAYVLSSW QRGSEMVISR YEKYYAGWPT NPIDEVRFVQ TNDEATVKAL ATSGQLGISS TTQANETYDA LAKTDGYVVQ TTPTATGFYL KLNTKATPTD DVHVRRALQY ATDYKTIQTQ IMTGDTLAGP LAQVFKDAYL DTLKAPEFDL EKAKAELAQS KYAGKPIKLT LTYVAGLSFE EDIALLMQSN LSQIGVDVDI KPEPWNRITE LAAKPETTPA ATQVFYGPTY PSPDSVFYVQ YHSKSAGTWA SMEWLQDAEV DKLIDEARST TDSAKQNAIY KQIQQAISDE APDVNLLTKV QKVAFSKCIS GYKFVPMQSW DYNFHDLTWT CPAK
|
| |