Gene Apar_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1034 
Symbol 
ID8413907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1165500 
End bp1166852 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content41% 
IMG OID645022623 
Producttype II secretion system protein E 
Protein accessionYP_003180053 
Protein GI257784836 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTCT TTGAACGCGT TGCTAAGCAA ATTGAGACGC AAGAAGAGAA TGACTCACAA 
CTTCAAAAAG AACTATTTCA GAAGGCCAGA AAGAAGTTAA AAGAAAAACT TGTTTCTCGA
TTAGGACTTT CCACGGTTTC TTCTCTTATT ACAGGTAGTG ATATCGATCG TGTAAGAGAT
GAACTAAGAG TTACCTGTGA AGCAATTGTT AATGAAGAAA AGGACGAACT CTTTAGCTCA
GTTGACTATG AGGAAACAAT TGAGCAGGTA ATTAATGAAG TGTGTGGTCT TGGTCCAATT
CAGCCGCTAC TGAATGATAA AGATGTTACC GAAATCATGA TTAATGGCTG CAACAATTTG
TTTTACGAGA AAAATGGTGA GCTATTTGAA TCAAAAACAG TCTTTGACTC GGAAGAACAG
ATTTTGATTG TTATGGATCG CATCTTATCT CCATTAGGTA GAAGACTGGA TAGAGTGAGT
CCTATTGTTG ATGCTCGACT TGAGAATGGG GACCGTGTTA ATGCAGTGGC AAGTCCTGTT
GCAGTGAATG GTACTTCAGT AACTATTCGC AGATTTACCG GAAAAATTAC TTCTTTGGAC
CGGCTGGTGC AAATGGAATC ATTGCCTCAA TGGCTTGCAA GGTTTTTGTC TTGGGCTGTG
AAATGCAGAC AGGGAATCGC GGTTGTTGGC GGCACGGGAT CTGGTAAGAC TACGCTTTTA
AATGCACTTT CATGTGAAAT TTCAAAGTCA GAAAGAATTG TCACTATTGA GGACTCAGCA
GAGCTTAAGT TTGATTCACA TCCAAATGTT GTCAGGCTAG AAGCTCGTCC TGCTTCGATA
GAAGGAACAG GAGAAATAAC AATTAGATCA TTGGTGAGAA ATGCACTTCG AATGAGGCCG
GACCGCATTG TCGTAGGAGA GGTAAGAGGC GAAGAGTGCA TTGATATGTT GCAGGCTATG
AATACAGGTC ATGACGGCTC TCTGACAACA TTGCATGCCG GAACTGCTCA AGAGGCAATA
TTGCGCTTAG TGTTAATGGC TCGTTTTGGA ATGGACTTAC CTGCAGAAAT CATAGAACAG
CAGATAGCAA CTGCACTTGA TTTGATAGTT ATGTCTCAAA GGTTTCCTGA TGGGAAACGG
TATGTGACCA GCGTTTCAGA AATCTCTTTG TCTTCGTCAG GATCAATCGA GGTACAAGAG
GTGGTGTCGT TTGACGTGCA GAAAAGAAGT TGGCTTTTTG TAAAAACTCC TTCTTTTATC
AATCGCGCAA TAAAAGAAGG ATTACTCAAT AAAAAGGAGG TGGCTTCATG GATGTCATTG
CTTCCACAAT CACAGGAGGT GCTTTTGGAG TAA
 
Protein sequence
MLVFERVAKQ IETQEENDSQ LQKELFQKAR KKLKEKLVSR LGLSTVSSLI TGSDIDRVRD 
ELRVTCEAIV NEEKDELFSS VDYEETIEQV INEVCGLGPI QPLLNDKDVT EIMINGCNNL
FYEKNGELFE SKTVFDSEEQ ILIVMDRILS PLGRRLDRVS PIVDARLENG DRVNAVASPV
AVNGTSVTIR RFTGKITSLD RLVQMESLPQ WLARFLSWAV KCRQGIAVVG GTGSGKTTLL
NALSCEISKS ERIVTIEDSA ELKFDSHPNV VRLEARPASI EGTGEITIRS LVRNALRMRP
DRIVVGEVRG EECIDMLQAM NTGHDGSLTT LHAGTAQEAI LRLVLMARFG MDLPAEIIEQ
QIATALDLIV MSQRFPDGKR YVTSVSEISL SSSGSIEVQE VVSFDVQKRS WLFVKTPSFI
NRAIKEGLLN KKEVASWMSL LPQSQEVLLE