Gene Apar_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0533 
Symbol 
ID8413387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp618688 
End bp619869 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content47% 
IMG OID645022106 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003179555 
Protein GI257784338 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000200352 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCACACA AGCTAACGCG ACGACGCTTT TTGGAGGTGG CCGGGACCAT TGCGGGGTCG 
TTTGCTTTAG CAGGCTGTTC GCCAAAGAAC CAAGACCCTA ACTCCATTTA CGTTGGCGTG
ATGGGTCCAT TTACCGGCGA CGTTGCTCAG TATGGACTCG CTGTGCGAAG TGGCGTTCTT
TTGTATCTTA AGGAGTTTAA CAAGAAGGGC GGCGTTAACG GTCGTCAGAT TGTTCCTGTT
GTTGAGGACG AGAAGGGCGA TTCAACTGAG GCTATTCTTG TTTACAACAA GCTTCTTGAC
GAGAATGTTT GTGCAATCCT TGGGGACGTT ACCTCAACGC CAACCATTGC ATTAGCACAG
AAATCGGTAC TGGACAACAT TCCTTGTGTT ACTGCATCTG CTACAGCAGA AGATGTTGTA
ACACACGGTA ATAACATGTT TCGTGCTACT GTTACGGACC CTTTTCAGGG CGTTGTTCTT
GCAGAGTTTG CTAAAAAGCA GGGCTATCAG CGTGTGGGAA CTATCTTTAA CTCCGGTGGC
GATTATGAGA TTGGTGTAAA TAACGCCTTT GTTCAGAAGG CTCGTGAGCT GGGCATTGAG
GTTGTATCCC AGCAGGGCTA TCCAACGGGA GCTGTTGACT TTAATGCCCA GCTTACCAGC
ATTATTGGTT TAGATCCTGA TGTTATCTTG GCTCCCAACT ATTATCAGGA TAACGGTAAG
ATTGTGACAC AGGCTCGCCA GCTTGGCTGC AAGAAGCCTT TTATGGGTGC AGATGGTTGG
GCAGGCATTA TTGGCGGTGA GCAGGATTAT GCTTCTGCTA CTGATCTTGA AGGATGCTTC
TATTGCTCCG CATTTGTTGC TTCAAATCCT GATGAAAAGG TGCAGCATTT TGTTACTGCT
TACACCGAGG AATACGGTGA GGCTCCAACT AATTTCTGTT CTTTGGGGTA TGACGCTGCA
ATGATTCTCT GCTCCGCTCT TGAGGTTGTC GAGAAACATG GCTATACCTA TAACTCTGAT
GCGTATAGAC AGGCAATTAT TGACGCTATT GCATCTGGTG TGGTCGAAGG TGTTACTGGA
GCCCTTTCAT ATCAGGGAAC AGGTGACCCA GTGAAGTCCA CCTTGATTAT TACCTTTAAA
GATGGCAAAG AGTCCATCTA TGACACAATC AATCCTTCAT AG
 
Protein sequence
MAHKLTRRRF LEVAGTIAGS FALAGCSPKN QDPNSIYVGV MGPFTGDVAQ YGLAVRSGVL 
LYLKEFNKKG GVNGRQIVPV VEDEKGDSTE AILVYNKLLD ENVCAILGDV TSTPTIALAQ
KSVLDNIPCV TASATAEDVV THGNNMFRAT VTDPFQGVVL AEFAKKQGYQ RVGTIFNSGG
DYEIGVNNAF VQKARELGIE VVSQQGYPTG AVDFNAQLTS IIGLDPDVIL APNYYQDNGK
IVTQARQLGC KKPFMGADGW AGIIGGEQDY ASATDLEGCF YCSAFVASNP DEKVQHFVTA
YTEEYGEAPT NFCSLGYDAA MILCSALEVV EKHGYTYNSD AYRQAIIDAI ASGVVEGVTG
ALSYQGTGDP VKSTLIITFK DGKESIYDTI NPS