Gene Apar_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0456 
Symbol 
ID8413305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp524276 
End bp525658 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content44% 
IMG OID645022024 
Productprotein of unknown function UPF0118 
Protein accessionYP_003179478 
Protein GI257784261 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.204189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAA AGCATCAAAA CTTTTCCGAT TCGATTCAGA GTTGGAAACA GCGTGGCCTG 
ATGGTATGGA CAGCCATTGG CTTTGCTGCG TTGTTTGCAC TTGCACTGTA TGTCCTTGGT
ATTTTGGGGC AGGCCGTTGA GTTGTTGGCT ATTGGCGCTA TTGTTGCGTT TGTGTGCAGT
CCTGTAACTA ACTGGCTTGA AGATAGGGGA ATACCTCGCG GTATTTCTGC TTTTGCAGCA
CTTATTCTTA CACTTATTGT GTTTGTAGGA TTTTTGATTT TGATTGCTCA ACCATTGGTG
CTTGAGCTTA CCACGCTGCT TAAGAATGCT CCTTCGTATG CAAGCCAGAT AGGAGCAATG
GCTAGGGAAT TTTGGCAGAA CTTTGACTCT CAGAGTAACC CAGCTGTTAG ACAGACGGTA
GAGCTTGTAA TTGAGCGGGC ATCGAGCATT GGAATATCAG TTGCTTCTGG CATTTTAAGT
TGGCTTTCAA CATCTGCTTT AGGCAATATT TCATCTATGG CAAACCAGCT TATGGTCTTT
TTCTTGGGTC TAGTGCTTGC CTATTGGCTT GCCAAAGATT ATCCCGTTAT TGTTCGTGAG
CTGGCTATTA TTGCAGGTCC TCAAAAAGAG GATGAGTTCA GACTTATTCT TGCAATCTTA
AGCAGATCTA CCAGTGGATA TATGCGTGGA ACTATCATTA CCTCTGCAGT TAACGGCATT
CTTGTGTACT TTGGTTGCCT TATTTTAGGT AACCCTTATG CTGCCCTCAT TGGTATGGTC
ACAGGAATCT TCCACATTAT TCCTGTGGTT GGACCGGTTT TCTCGGCAGG CATTGCTCTG
ATTCTGAGTA TTTTGGTAGA CCCCATCATG ACCGTGTGGA CCATCGTTAT CTTGATGGTT
GCTCAAAACG TTGTGGATAA TGTGCTTTCA CCTTTGGTTA TGGCAACTAG CGTCAAAGTC
CATCCGGGTC TTTCACTTAT AGGCATTGTT ATTGGTAGCG CTCTTGGCGG AGTAGTCGGA
ACCATTCTTG CAATTCCACT GACTGCAGCA CTTAGAGGTA TTTTTGTGTA CTTCTTTGAG
AAGTACTCGG GCAGACAGAT TGTCTCACCA AATGGTGCGC TCTTTAATTC CACGCAGTAT
GTGGATGAGA AAGGCGCTAT TTTGCCAGAG TATGATGCAC TGGACGATCC AAAGTTTTTT
GAGGAGTCAC GTCTTGTTGA TCAAGACACT ACGGCTCATA TTCGTAGTAA GTCTTCAATT
CCTGCGCCAA AGATTCTTGG GCATGATTTT TCTCAGTTAC TTTTTAGGAA TACTCAAGAA
GTTATTAAAG AACCAGATAA ACCATCGTCA GATACGGTAG ACTCAGACAG TACAAAAGAG
TAG
 
Protein sequence
MDQKHQNFSD SIQSWKQRGL MVWTAIGFAA LFALALYVLG ILGQAVELLA IGAIVAFVCS 
PVTNWLEDRG IPRGISAFAA LILTLIVFVG FLILIAQPLV LELTTLLKNA PSYASQIGAM
AREFWQNFDS QSNPAVRQTV ELVIERASSI GISVASGILS WLSTSALGNI SSMANQLMVF
FLGLVLAYWL AKDYPVIVRE LAIIAGPQKE DEFRLILAIL SRSTSGYMRG TIITSAVNGI
LVYFGCLILG NPYAALIGMV TGIFHIIPVV GPVFSAGIAL ILSILVDPIM TVWTIVILMV
AQNVVDNVLS PLVMATSVKV HPGLSLIGIV IGSALGGVVG TILAIPLTAA LRGIFVYFFE
KYSGRQIVSP NGALFNSTQY VDEKGAILPE YDALDDPKFF EESRLVDQDT TAHIRSKSSI
PAPKILGHDF SQLLFRNTQE VIKEPDKPSS DTVDSDSTKE