Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0960 |
Symbol | |
ID | 8413831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1081093 |
End bp | 1082736 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645022548 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003179980 |
Protein GI | 257784763 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAGA ACTCACTTAG CCGTCGTAAT TTTTTGAAGG CATCAGGTAT GGTAGCGGCT GGTGGTGGCG CATCTATGAT GTTGGCTGGC TGCAACTCTT CTACTGATAC TCCTCAGGCA TCCGATAAGA AAAAGATTCT TCGTTATGGT TCTGCAAACT CTAAGCAAGG TCTTGATATG CAGCGTGCTA ATAACTCACA GTCCGCATGC GTTGCAGATT CTGTATGTGA GTCTTTGCTT CGTTGGACTG AGGATAACGA GCTTGTTCCA TGTCTACTCA AAGAGACTCC AACCTTTGAG TCTGATGGTG TTACTCTTAA GTGCGAGCTT AAAGAGGGTA TCAAGTTTCA CGATGGTACT ACTTTGACCG CAAAGGATGT TAAGTACACC TTTGAGCGTA TGTTTAAGCC AGAGACTAAG GCTCTCTCCA CCAGCTTCTT TGACAAGATC AAGGGAGCTC CAGAGATGCT TGCCGGTAAG ACAACTGAGC TTGAGGGCCT GACTGTTCAG GATGATACTC ACTTCACCTT TACACTTTCT GAGCCTTACA TTGTTTTTGT AAGCGTTCTT GGAATCTCCT ATGCCTGCAT TTTCCCAGAG AAGGCATGCG AGGCTGCTGG TACTGATTGG GGTACTGGTA CTAACCTCAT TGGTACTGGT AAGTACAAGA TTAAGAGCAA CGATGACACC ACTGAGGTAG TTCTTGAGCG TTTTGATGAC TATCACGATG GCAAGCCTGC TCTGGATGAG ATTCGTATCC ACTACTACGA TGATTCCAAC ACTCAGTTAC TTGCATTTAA AAACAACGAT ATTGACTTCT GCGATCTTCC ATCTTCTTTG TATCAGCAAT ATTCAAGCGA TGCTGACGTT AAGGATCTTA TTACTGCATA TCAGACCCTT GGTGTTAACT TCATCAACCT CAACCTCAAA GAGGGCTTTG AGCTTACCGA TCCTAAGGTT CGCCAGGCGC TTTCCTTGGC AATTAACCGC CAGGAGCTTG TTGACAACAT TGCTTCAGGT AATGGTACCG TTGCAAGCGG CTGGCTTGCA CCATCAACTC CTGGTTATGA TAAGAGTGCT CCTGCTTTTG AGTACAATCC AGAGAAGGCT AAGTCTCTTC TTGCTGAGGC TGGTGTCACT GACCTTAAGC TTTCTGCAAA GGTTAGAAAA TCTTACGAGA AGCTCCTTGT TGCCGTTCAG GAGTACTGGA ACAAGATTGG TGTTACCCTT GATGTTCAGG TAGAGGACAA CGCAGTTTGG AATACTGACT GGGCTGCTGG TAACCTTCAG ATTACTACTC TTGGTTGGTA TCCACTGTTT GCTGATGGCG ATCAGCACCT TTACACCTAC TTCTACAGCA CTAATGCTGC AAAGAAGTCT TCCTTCTATA ATAGTTCCGA GTTTGACAAG CTTCTTGTTG AGGGTCGCTC TGAGTCTAAC AAGGATAAGA GAACAGAGGA TTACAAAGAC GCAGACAATC TGTTGACCCG TACAGACTTT GCGACACTTC CTCTGTACTG GCCAAAGAAT TCCTTTGTCG CTAAGAAGTA CGTCAAGAAC GCTAAGGTTG GAAACTTGAT TTATCACTTC TTTGATATTG ATATTGATAC ATCTAATTCC GATTACACCG TTCCAGAAGA CTAA
|
Protein sequence | MLENSLSRRN FLKASGMVAA GGGASMMLAG CNSSTDTPQA SDKKKILRYG SANSKQGLDM QRANNSQSAC VADSVCESLL RWTEDNELVP CLLKETPTFE SDGVTLKCEL KEGIKFHDGT TLTAKDVKYT FERMFKPETK ALSTSFFDKI KGAPEMLAGK TTELEGLTVQ DDTHFTFTLS EPYIVFVSVL GISYACIFPE KACEAAGTDW GTGTNLIGTG KYKIKSNDDT TEVVLERFDD YHDGKPALDE IRIHYYDDSN TQLLAFKNND IDFCDLPSSL YQQYSSDADV KDLITAYQTL GVNFINLNLK EGFELTDPKV RQALSLAINR QELVDNIASG NGTVASGWLA PSTPGYDKSA PAFEYNPEKA KSLLAEAGVT DLKLSAKVRK SYEKLLVAVQ EYWNKIGVTL DVQVEDNAVW NTDWAAGNLQ ITTLGWYPLF ADGDQHLYTY FYSTNAAKKS SFYNSSEFDK LLVEGRSESN KDKRTEDYKD ADNLLTRTDF ATLPLYWPKN SFVAKKYVKN AKVGNLIYHF FDIDIDTSNS DYTVPED
|
| |