Gene Apar_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1350 
Symbol 
ID8414239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1521616 
End bp1522827 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content47% 
IMG OID645022951 
Producthypothetical protein 
Protein accessionYP_003180365 
Protein GI257785148 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.692835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.661465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCTTT TTGGGAACAT TTCTCGTCGA AACTTTTTTA AGACGGGAGC AGCTAGTGTG 
GTGGTTGCTG GCGCAATTAG TATTGCGGAA GGTTGCTCTC ACCAAGCAGG TTTGGGAGAT
GTAGGAAAAC CTCTGGTACT TGATGAGTCT GCTGGCACTA ATGTGTTGGA CTCCTATAGT
TCAGCAGAAT ATTCTGTCCA GCCAAGCCAG ACATGGACAC TTCCTCTAGG AAGCGTGCTA
CACCCTGCTG ACGGTAATTG GATCCCCGTT ACTACGGCTG GCGCGTCTGC TACGCCAATG
GTGAAGGGTT CTGCTCTTTC GCTGACGTCT GGTCAAGTTG TTGATGTTGT TCCTACAGCT
CAGATGAACA ATACTACTGC GGTCATTTAC GATGTTCGTT GCTCCGATTC CGTTTACGCC
TGGGTTGAGG TGGATACCAC AACCTTTGAT TGGGAGCTTT TTGCTGCTCC ATTTTCTGAT
GGCAAGCTTA CAGGAGACGC AAAGGTACTT TACAAAGCTG ATAAAAACTG GGACCCTGCA
CCGTTTGCCT GTGGAGACGA TAAGGTAGTT TGGATTGTCC AGCCTTCTTC CTCGGGCGAG
AAAACCCGCG AGTCTTCTCA CTGCTACGTG TGGCGTGTGG GTGATTCTGA AGGAACAGAC
GCCGTTGAAT CACCAGGTAG ATTTGCTACT GCACCCTCTA TTTCCAAGGG CGTAGTTACT
CTTACTCCCC GAGTTCGTGC TTCCGAAGGA ACCTATTATG GCGTTACGGC ATATCTGCTG
GGAGATAACC TTAAAACTAA GGTTGACCAG CTAGTTATGC CTCAATCTGT TAAACCATTT
GCAGCAAGCC GTGTAGACGA TAAGTTTATT GTGTCCGTTG AAGCAAGTTA TGACTCAGGT
GGTCTTTTAG GCAAGATGGG AACGTACATC TTACCTGCTT CAGGAGAAAA TCCTTACATT
ATTGAGCGTG AACCTTATGC AATATCTTCA GGTAAAGGTA GCCTTTATAT AATTAAGAGT
CGTGCTTCTT ATGTGCTTGT TGATACGCAA AATCAAACGG CAAATTGGCT CTATTCGATG
GATAGAAGCG TTGATTATGG AGAATTTCCT GCTCGAGAGG GCAACTGCGA TTCTTTTGTA
ACCTTCTCAA CTGTTAAGGA CCCTACTAGC GGATATCCTG CCTCTGTGGC AGTTCGTGTA
TTTTCTTTGT AG
 
Protein sequence
MTLFGNISRR NFFKTGAASV VVAGAISIAE GCSHQAGLGD VGKPLVLDES AGTNVLDSYS 
SAEYSVQPSQ TWTLPLGSVL HPADGNWIPV TTAGASATPM VKGSALSLTS GQVVDVVPTA
QMNNTTAVIY DVRCSDSVYA WVEVDTTTFD WELFAAPFSD GKLTGDAKVL YKADKNWDPA
PFACGDDKVV WIVQPSSSGE KTRESSHCYV WRVGDSEGTD AVESPGRFAT APSISKGVVT
LTPRVRASEG TYYGVTAYLL GDNLKTKVDQ LVMPQSVKPF AASRVDDKFI VSVEASYDSG
GLLGKMGTYI LPASGENPYI IEREPYAISS GKGSLYIIKS RASYVLVDTQ NQTANWLYSM
DRSVDYGEFP AREGNCDSFV TFSTVKDPTS GYPASVAVRV FSL