Gene Apar_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0951 
Symbol 
ID8413822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1070558 
End bp1071574 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content46% 
IMG OID645022539 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_003179971 
Protein GI257784754 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00952521 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGAC GTTCGTTCCT TAAGCTTGCA AGTCTTATTC CAGCTACAGC ATTGTTTGGC 
TGCAAGGGAA CAAGTCAAGA GAAGTCTCAA GAAACTGCAA AAGATGAGGC TAAGAAATCT
GACCCCGTTG CGGTAAAAGT TGCAACACTT AAAGGACCTA CCGCCATGGG CTTGGTTAAG
TTCATGAGTG AGGTTGAAGC AAAAAACATT ACCGACAACA ATTATTCATT TGAGATTTTA
GATGCCCCTG ATCAGGTAGT TGCTAAGGTA GCTCAGGGTG ATGTCGACGT TGCGTCTATT
CCTGCAAACC TTGCCGCTAC GTTATTCAAC AAGACCAAAG GTGCCTACAA GGTAGCTTGC
CTCAACGTAC TGAATGTTCT CTACATTGTT GAGACGGGAA GCGCTATTTC TAAGATTGCT
GACCTTAAGG GAAAGACGCT CTATGCCTCT GGTAAGGGTG CTGTTCCAGA GTACACACTG
TCCTACTTGT TGAGCAAAAA TGGTATGACG CTTGGTGAAG ATGTCCAGGT TGAGTGGAAG
AGCGAGCATA CCGAGTGCGT TGCAGCTCTA GCACAAGATC CAGAGGGAAT CGCATTGCTT
CCACAGCCTT TTGTTACCGT GGCACAAACC AAGAACAGTC AGATTCGCAT AGCAATTGAC
CTTGGTGCCG AGTGGGAGAC AGTTAATCCT CAGAGTAAGT TGATTGCAGG CGTAACCATT
ATTTCTTCAA AGCTTATCTC GGATTCTCCA GATGCTGTGA CTGCTCTGCT TTCTCACTAC
AAAGACTCTG TTGAATTTGC TGTTGATCAT CCAGATGATG CTGCTACACT TGTGGGCAAA
TACGGCATTG TTCCAGAGCC TATTGCCAAG GTTGCACTGC CTAAGTGTAA TATTACGTAT
ATTGATGGCG CAGATATGAA GACTGCACTT TCAAGTTATT TAGGCATTCT GGCCGAGGCT
AATCCTCAGT CAGTAGGCGG ACAGGTTCCC GGAGACGATT TCTACTTTGG CGCATAA
 
Protein sequence
MDRRSFLKLA SLIPATALFG CKGTSQEKSQ ETAKDEAKKS DPVAVKVATL KGPTAMGLVK 
FMSEVEAKNI TDNNYSFEIL DAPDQVVAKV AQGDVDVASI PANLAATLFN KTKGAYKVAC
LNVLNVLYIV ETGSAISKIA DLKGKTLYAS GKGAVPEYTL SYLLSKNGMT LGEDVQVEWK
SEHTECVAAL AQDPEGIALL PQPFVTVAQT KNSQIRIAID LGAEWETVNP QSKLIAGVTI
ISSKLISDSP DAVTALLSHY KDSVEFAVDH PDDAATLVGK YGIVPEPIAK VALPKCNITY
IDGADMKTAL SSYLGILAEA NPQSVGGQVP GDDFYFGA