Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0533 |
Symbol | |
ID | 8413387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 618688 |
End bp | 619869 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022106 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003179555 |
Protein GI | 257784338 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000200352 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCACACA AGCTAACGCG ACGACGCTTT TTGGAGGTGG CCGGGACCAT TGCGGGGTCG TTTGCTTTAG CAGGCTGTTC GCCAAAGAAC CAAGACCCTA ACTCCATTTA CGTTGGCGTG ATGGGTCCAT TTACCGGCGA CGTTGCTCAG TATGGACTCG CTGTGCGAAG TGGCGTTCTT TTGTATCTTA AGGAGTTTAA CAAGAAGGGC GGCGTTAACG GTCGTCAGAT TGTTCCTGTT GTTGAGGACG AGAAGGGCGA TTCAACTGAG GCTATTCTTG TTTACAACAA GCTTCTTGAC GAGAATGTTT GTGCAATCCT TGGGGACGTT ACCTCAACGC CAACCATTGC ATTAGCACAG AAATCGGTAC TGGACAACAT TCCTTGTGTT ACTGCATCTG CTACAGCAGA AGATGTTGTA ACACACGGTA ATAACATGTT TCGTGCTACT GTTACGGACC CTTTTCAGGG CGTTGTTCTT GCAGAGTTTG CTAAAAAGCA GGGCTATCAG CGTGTGGGAA CTATCTTTAA CTCCGGTGGC GATTATGAGA TTGGTGTAAA TAACGCCTTT GTTCAGAAGG CTCGTGAGCT GGGCATTGAG GTTGTATCCC AGCAGGGCTA TCCAACGGGA GCTGTTGACT TTAATGCCCA GCTTACCAGC ATTATTGGTT TAGATCCTGA TGTTATCTTG GCTCCCAACT ATTATCAGGA TAACGGTAAG ATTGTGACAC AGGCTCGCCA GCTTGGCTGC AAGAAGCCTT TTATGGGTGC AGATGGTTGG GCAGGCATTA TTGGCGGTGA GCAGGATTAT GCTTCTGCTA CTGATCTTGA AGGATGCTTC TATTGCTCCG CATTTGTTGC TTCAAATCCT GATGAAAAGG TGCAGCATTT TGTTACTGCT TACACCGAGG AATACGGTGA GGCTCCAACT AATTTCTGTT CTTTGGGGTA TGACGCTGCA ATGATTCTCT GCTCCGCTCT TGAGGTTGTC GAGAAACATG GCTATACCTA TAACTCTGAT GCGTATAGAC AGGCAATTAT TGACGCTATT GCATCTGGTG TGGTCGAAGG TGTTACTGGA GCCCTTTCAT ATCAGGGAAC AGGTGACCCA GTGAAGTCCA CCTTGATTAT TACCTTTAAA GATGGCAAAG AGTCCATCTA TGACACAATC AATCCTTCAT AG
|
Protein sequence | MAHKLTRRRF LEVAGTIAGS FALAGCSPKN QDPNSIYVGV MGPFTGDVAQ YGLAVRSGVL LYLKEFNKKG GVNGRQIVPV VEDEKGDSTE AILVYNKLLD ENVCAILGDV TSTPTIALAQ KSVLDNIPCV TASATAEDVV THGNNMFRAT VTDPFQGVVL AEFAKKQGYQ RVGTIFNSGG DYEIGVNNAF VQKARELGIE VVSQQGYPTG AVDFNAQLTS IIGLDPDVIL APNYYQDNGK IVTQARQLGC KKPFMGADGW AGIIGGEQDY ASATDLEGCF YCSAFVASNP DEKVQHFVTA YTEEYGEAPT NFCSLGYDAA MILCSALEVV EKHGYTYNSD AYRQAIIDAI ASGVVEGVTG ALSYQGTGDP VKSTLIITFK DGKESIYDTI NPS
|
| |