Gene Apar_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0911 
Symbol 
ID8413779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1019509 
End bp1020807 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content45% 
IMG OID645022496 
Producthypothetical protein 
Protein accessionYP_003179931 
Protein GI257784714 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00529353 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.444552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCC CAGATGGCGC GTCTGAAACC GAGCACCCCT CATCTTCATC TCGTGTGACG 
GTAAAGATTG GATCTACAAC CGTCAAGGGT GTTTCAACTA AAAAACCTAA GATTAGTGTT
GCTCCTACTG CTTCATCTGA ACAGGTAGAA GAGGCAAAAG AGAGTTTTGG TCAGGTTAGA
AAAAGTATCC TCTTCCTCTT TGCGGTAGTA ATACTTTATG TACTCTACAT TGTTTTCTCT
GGTCAGTTTG ATGAGTTTGT TGTTGCCCTA GCTGACGTAG ACACAGGATG GCTTATTGCT
GGTGCTCTTT GCTATTTTGT CTACTATTTC CTTGGTATTC TCGCTTATGT GATTTCGGTT
ATTACAGATC CTGAGAGTCC CGTTGGCGTT AGAGATTTGA TGAGCGTTGA GGCGTCGGGC
ATCTTCTTCA TGAATTTAAC GCCAAACGGC GCAGGTGCTG CTCCTGCTCA GATTTACCGT
TTGACTCGTG CGGGTATTTC TGTTGGCCAA GCAGGAGCCT TGCAGTTTAC ACGCTTTGTT
ATGTATGAGG CAGGCGAGGG AATCTTTGCA GCTCTTATGC TTATTTTTAG GGCTAATTAT
TTCTATGAGC AATTTGGCGA CGTAACGGTT ATTGGCGTTA TTTTATTTGG TGCAAAGATA
GTGATGGTTG CAGTTATCTT AGCTATCTGC CTGCTTCCTA AAATGGTAAA GTCAGTGGGA
AACTGGGGCT TAAGATTGCT CTCTAAGCTA CGCATTGTAA AGCGCTACGA TCACTGGCAT
GGCATTATTA ACACACAGGT AGACGAGTTC TCAAACGGCT TTAAGACTGC CGCAAAGAAT
ATCCCAGAAA TGTGTATTGT CTTGCTGGTC ACCTTAGTTC AGCTTGGCTG CCTCTATGCG
CTTCCATACT TTGTTTTGCG TGCACTTGGG CAACCAGCTG ATCTTTTGAC CTGCCTTGCT
TCTGGTTCAA TGCTTGAGCT CTTGACTTCG GCTATTCCGC TTCCTGGTGG TACTGGTGGT
GCTGAGGGTG GCTTTGCCTT TTTGTTTGGT CATATGTTTG GTGAGAAAAT TGCCGCAGGC
TTTGTACTCT GGCGCGCTAT CGAATATCTG CTTCCAACTC TGGTAGCAAG CATGCTTTTG
GGGCTCAGAT CTCATGATCA TGAGCCTATA TACCATAAGT GGAATCGCTT CCGCCAGCGT
TTCTCTGCCT TTGTAAATGG CGAGAAACCC GCTGCTGCTT CTACGTTACC TCGTCCAGAT
ACATCTGGTA TCCAAATTAA GGTTAAGCGT AAGAAATAG
 
Protein sequence
MDRPDGASET EHPSSSSRVT VKIGSTTVKG VSTKKPKISV APTASSEQVE EAKESFGQVR 
KSILFLFAVV ILYVLYIVFS GQFDEFVVAL ADVDTGWLIA GALCYFVYYF LGILAYVISV
ITDPESPVGV RDLMSVEASG IFFMNLTPNG AGAAPAQIYR LTRAGISVGQ AGALQFTRFV
MYEAGEGIFA ALMLIFRANY FYEQFGDVTV IGVILFGAKI VMVAVILAIC LLPKMVKSVG
NWGLRLLSKL RIVKRYDHWH GIINTQVDEF SNGFKTAAKN IPEMCIVLLV TLVQLGCLYA
LPYFVLRALG QPADLLTCLA SGSMLELLTS AIPLPGGTGG AEGGFAFLFG HMFGEKIAAG
FVLWRAIEYL LPTLVASMLL GLRSHDHEPI YHKWNRFRQR FSAFVNGEKP AAASTLPRPD
TSGIQIKVKR KK