Gene Apar_0960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0960 
Symbol 
ID8413831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1081093 
End bp1082736 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content45% 
IMG OID645022548 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003179980 
Protein GI257784763 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAGA ACTCACTTAG CCGTCGTAAT TTTTTGAAGG CATCAGGTAT GGTAGCGGCT 
GGTGGTGGCG CATCTATGAT GTTGGCTGGC TGCAACTCTT CTACTGATAC TCCTCAGGCA
TCCGATAAGA AAAAGATTCT TCGTTATGGT TCTGCAAACT CTAAGCAAGG TCTTGATATG
CAGCGTGCTA ATAACTCACA GTCCGCATGC GTTGCAGATT CTGTATGTGA GTCTTTGCTT
CGTTGGACTG AGGATAACGA GCTTGTTCCA TGTCTACTCA AAGAGACTCC AACCTTTGAG
TCTGATGGTG TTACTCTTAA GTGCGAGCTT AAAGAGGGTA TCAAGTTTCA CGATGGTACT
ACTTTGACCG CAAAGGATGT TAAGTACACC TTTGAGCGTA TGTTTAAGCC AGAGACTAAG
GCTCTCTCCA CCAGCTTCTT TGACAAGATC AAGGGAGCTC CAGAGATGCT TGCCGGTAAG
ACAACTGAGC TTGAGGGCCT GACTGTTCAG GATGATACTC ACTTCACCTT TACACTTTCT
GAGCCTTACA TTGTTTTTGT AAGCGTTCTT GGAATCTCCT ATGCCTGCAT TTTCCCAGAG
AAGGCATGCG AGGCTGCTGG TACTGATTGG GGTACTGGTA CTAACCTCAT TGGTACTGGT
AAGTACAAGA TTAAGAGCAA CGATGACACC ACTGAGGTAG TTCTTGAGCG TTTTGATGAC
TATCACGATG GCAAGCCTGC TCTGGATGAG ATTCGTATCC ACTACTACGA TGATTCCAAC
ACTCAGTTAC TTGCATTTAA AAACAACGAT ATTGACTTCT GCGATCTTCC ATCTTCTTTG
TATCAGCAAT ATTCAAGCGA TGCTGACGTT AAGGATCTTA TTACTGCATA TCAGACCCTT
GGTGTTAACT TCATCAACCT CAACCTCAAA GAGGGCTTTG AGCTTACCGA TCCTAAGGTT
CGCCAGGCGC TTTCCTTGGC AATTAACCGC CAGGAGCTTG TTGACAACAT TGCTTCAGGT
AATGGTACCG TTGCAAGCGG CTGGCTTGCA CCATCAACTC CTGGTTATGA TAAGAGTGCT
CCTGCTTTTG AGTACAATCC AGAGAAGGCT AAGTCTCTTC TTGCTGAGGC TGGTGTCACT
GACCTTAAGC TTTCTGCAAA GGTTAGAAAA TCTTACGAGA AGCTCCTTGT TGCCGTTCAG
GAGTACTGGA ACAAGATTGG TGTTACCCTT GATGTTCAGG TAGAGGACAA CGCAGTTTGG
AATACTGACT GGGCTGCTGG TAACCTTCAG ATTACTACTC TTGGTTGGTA TCCACTGTTT
GCTGATGGCG ATCAGCACCT TTACACCTAC TTCTACAGCA CTAATGCTGC AAAGAAGTCT
TCCTTCTATA ATAGTTCCGA GTTTGACAAG CTTCTTGTTG AGGGTCGCTC TGAGTCTAAC
AAGGATAAGA GAACAGAGGA TTACAAAGAC GCAGACAATC TGTTGACCCG TACAGACTTT
GCGACACTTC CTCTGTACTG GCCAAAGAAT TCCTTTGTCG CTAAGAAGTA CGTCAAGAAC
GCTAAGGTTG GAAACTTGAT TTATCACTTC TTTGATATTG ATATTGATAC ATCTAATTCC
GATTACACCG TTCCAGAAGA CTAA
 
Protein sequence
MLENSLSRRN FLKASGMVAA GGGASMMLAG CNSSTDTPQA SDKKKILRYG SANSKQGLDM 
QRANNSQSAC VADSVCESLL RWTEDNELVP CLLKETPTFE SDGVTLKCEL KEGIKFHDGT
TLTAKDVKYT FERMFKPETK ALSTSFFDKI KGAPEMLAGK TTELEGLTVQ DDTHFTFTLS
EPYIVFVSVL GISYACIFPE KACEAAGTDW GTGTNLIGTG KYKIKSNDDT TEVVLERFDD
YHDGKPALDE IRIHYYDDSN TQLLAFKNND IDFCDLPSSL YQQYSSDADV KDLITAYQTL
GVNFINLNLK EGFELTDPKV RQALSLAINR QELVDNIASG NGTVASGWLA PSTPGYDKSA
PAFEYNPEKA KSLLAEAGVT DLKLSAKVRK SYEKLLVAVQ EYWNKIGVTL DVQVEDNAVW
NTDWAAGNLQ ITTLGWYPLF ADGDQHLYTY FYSTNAAKKS SFYNSSEFDK LLVEGRSESN
KDKRTEDYKD ADNLLTRTDF ATLPLYWPKN SFVAKKYVKN AKVGNLIYHF FDIDIDTSNS
DYTVPED