Gene Apar_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0422 
Symbol 
ID8413271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp488470 
End bp490290 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content45% 
IMG OID645021990 
ProductABC transporter related 
Protein accessionYP_003179444 
Protein GI257784227 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAATC GCGCAGGAGG AGCACGTCCT CAAAATATCG GACATACCAT TAGAGTGTTT 
GTTTCGTACT TGGGTCATGC TAAGAAACGC CTTATGTTGG TGGCACTTCT TGTTTCTATT
AGTGGTTTAG CTGCGCTCCT TGGCACCTAT ATGATTAAGC CTGTTGTTGC TGCGGTGGGC
AGGGGAGACG TAAACGCATT CACCAATCTC ATTGTGTTTA CGGCTGTTGT GTATGCAATT
GGTGCTCTTA CTAGTGTGGG TTACACGCAA ATCATGGTTC GCGCTGCTCA GAGGATTGTT
TTTGACATAA GACGAGACCT CTTTGAGCAT ATTGAGTCGT TGCCGTTAAG GTTTTTTGAC
CAGCGTACGC GTGGCGATAT CATGAGTTTC TTTACAAATG ACGTGGATAC TATTTCAGAA
GCGCTCAACA ATAGTTTTGC CAATCTTGTT CTGGCGTTTA TTCAGATGGT TGGTACACTT
GTGCTGCTCA TTGTTCTTAA TTGGCAGTTG ACGCTTATTA CTATTCTCTT TGATGTGTTG
ATTGTAGTAT ATGCGCGTTA TGCAAGTAAG CGTTCAGCTA AGCATTATTC AGTGCAACAA
AAGAGCCTTG GTGTTCTTAA TGGCTTTGCA GAGGAGACCA TTTCTGGACT TAAGGTAGTT
AAAGTCTTTA ATCATGAGGA TGCTAATTTT ACTGATTTTC AAAAGGTGAA TGAGGAGCTA
AGAAGCTCTG GAACAAGCGC TCAGTCATAC GCAGCCACGA TGGTTCCTAT TACTGTTTCT
TTGACTTACA TTAACTACGC AGTAGTTACC GTGGTTGGAG CACTTTTGGC TGTAAATGGA
CACGCTGATG TTGCAAGTCT TGCATCCTAC CTGGTTTTTG TTCGCCAGGC AGCCATGCCG
TTCAACCAGT TCACACAGCT CTCCAACTTC CTGCTCACTG CTCTTGCCGG TGCCGAGCGC
ATCTTTGCTG TTATGGAGAT GACTCCCGAG GTTGATGAGG GTAAGGTTAA ACTGCTTCGC
GTAAGTGGCT ACGAGTCAGG TGACGAATCT TGGGCATGGG TTAAACCATC TGGCGAGAAA
ATCCCTCTTG CTGGAGACGT ACGCTTTGAT GAGGTAACTT TTGGTTACGA CGAGGGACAG
ACGGTTCTTG CTAACCTTTC TTTGTTTGCA AAACCTGGCC AGAAGATTGC CTTTGTTGGT
TCAACAGGTG CTGGTAAAAC AACTATTACC AATCTTATTA ACCGCTTTTA TGATGTTCGC
AGTGGCACTA TCACTTATGA TGGCATTCCT ATTAACGATA TTAAGAAGTC AGCACTTCGC
TCTTCTTTGG GCATTGTATT GCAAGACACG CACCTGTTTA CCGGTACCAT TGCAGATAAT
ATCCGTTTTG GTAAGTTAGA TGCTACCGAT GACGAAGTTA TTGCTGCAGC AAAGATTGCA
AACGCGGACA AGTTTATTCG CCGCATGCCT CGTGGTTACA ACACTGTGCT TACTTCCGAC
GGTGCTAATC TTTCTCAAGG TCAACGACAG CTTCTGGCTA TTGCACGTGC TGCTATTGCT
GATCCTCCCG TACTTATTCT AGATGAAGCA ACTTCCAGCA TTGATACGCG TACTGAGGCA
CTTATCGAGA AGGGCATGGA TGCCCTTATG GCAGGTCGTA CCGTCTTTGT AATTGCTCAT
CGCTTATCTA CGGTACGTAA TGCAAATGCC ATTATGGTAC TTGAACATGG ACGCATTATT
GAGCGCGGCA CTCATGATGA GTTGATTGCT CAGCATGGTG AGTATTATCA GCTTTATACC
GGCTCACGTG AACTTGAGTA G
 
Protein sequence
MANRAGGARP QNIGHTIRVF VSYLGHAKKR LMLVALLVSI SGLAALLGTY MIKPVVAAVG 
RGDVNAFTNL IVFTAVVYAI GALTSVGYTQ IMVRAAQRIV FDIRRDLFEH IESLPLRFFD
QRTRGDIMSF FTNDVDTISE ALNNSFANLV LAFIQMVGTL VLLIVLNWQL TLITILFDVL
IVVYARYASK RSAKHYSVQQ KSLGVLNGFA EETISGLKVV KVFNHEDANF TDFQKVNEEL
RSSGTSAQSY AATMVPITVS LTYINYAVVT VVGALLAVNG HADVASLASY LVFVRQAAMP
FNQFTQLSNF LLTALAGAER IFAVMEMTPE VDEGKVKLLR VSGYESGDES WAWVKPSGEK
IPLAGDVRFD EVTFGYDEGQ TVLANLSLFA KPGQKIAFVG STGAGKTTIT NLINRFYDVR
SGTITYDGIP INDIKKSALR SSLGIVLQDT HLFTGTIADN IRFGKLDATD DEVIAAAKIA
NADKFIRRMP RGYNTVLTSD GANLSQGQRQ LLAIARAAIA DPPVLILDEA TSSIDTRTEA
LIEKGMDALM AGRTVFVIAH RLSTVRNANA IMVLEHGRII ERGTHDELIA QHGEYYQLYT
GSRELE