Gene Apar_0252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0252 
Symbol 
ID8413100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp293046 
End bp294104 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content41% 
IMG OID645021819 
Productselenide, water dikinase 
Protein accessionYP_003179274 
Protein GI257784057 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0491527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATG ACGTAAAACT CACAAAACTT GCTGATTGTG CTGGTTGTGG TGCTAAGGTT 
GGTGCCGGTG AGTTGGCAAA GCTTCTTTCA GATATTAAAG TTCATGAGGA TCCTAATTTA
TTAGTTGGTT TTGATAAAGC GGATGATGCA GCTGTTTACA AAGTGACTGA TGAAATAGCG
CTTGTCGAAA CAATCGATTT CTTTCCTCCA ATTGCTGATG ACCCATATAC GTATGGCGCT
ATTGCGGCTA CCAATGCTTT ATCGGATGTA TATGCTATGG GTGGAGAGCC AAAGGTTGCT
CTTAATGTTA TGGCTGTACC CGAAGACATG TCCTCGCATG TAGTGTATGA GATTTTACGT
GGTGGTTACG ATAAAGTATA TGAGGCTGGT GCCAATATTG TTGGCGGTCA TAGCATCTAT
GACAATGAAC CAAAATATGG TCTTGCTGTT TCAGGGTTTG TTAATCCAAA AAAGATGTAT
ACCAATTCTG GTGCACGTGC AGATGACGTT TTAATTCTTA CAAAGGCTCT TGGTGTAGGT
GTTCTGACAA CGGCTGCTAA AGCAGATATG CTCTCTATAG AAGAGATGAG TGTTGCTGAG
GCGTCTATGA TGACTCTTAA TCGGTACGCT CGAGACATTA TGGTGAACTA TGACGTGCAT
GCTGTTACGG ATGTAACGGG ATTTTCACTT ATGGGACATT TGCTTGAAAT GTGTCAGGGT
TCGGGGCTTT TGGCAAAGAT TACCGTAAAT AATATAAAAT TTTTATCTAC GAGAGTATTT
GAATTAGCAC GTTTAGGAAT TTTACCAGCT GGTATGTATC GTAATCGTCA CTATGCAGAG
AAGTACGTGC AAGCAGAAGG GGTTTCTCGT GAGATGAGTG ACGTGCTCTT CTGTCCTGAA
ACTTCTGGTG GATTATTGAT AAGCGTAGCC CATGATGATG CTCAGCATTT ACTTGCGGCT
CTTCAAGCAA ATGAACATAC TGCTAATTCT TGTGTTGTGG GATATATGGA AAAGCAAGAC
ATAAGAAATA AGGATGCAAC TTATATTGTG CTTCAGTAA
 
Protein sequence
MSNDVKLTKL ADCAGCGAKV GAGELAKLLS DIKVHEDPNL LVGFDKADDA AVYKVTDEIA 
LVETIDFFPP IADDPYTYGA IAATNALSDV YAMGGEPKVA LNVMAVPEDM SSHVVYEILR
GGYDKVYEAG ANIVGGHSIY DNEPKYGLAV SGFVNPKKMY TNSGARADDV LILTKALGVG
VLTTAAKADM LSIEEMSVAE ASMMTLNRYA RDIMVNYDVH AVTDVTGFSL MGHLLEMCQG
SGLLAKITVN NIKFLSTRVF ELARLGILPA GMYRNRHYAE KYVQAEGVSR EMSDVLFCPE
TSGGLLISVA HDDAQHLLAA LQANEHTANS CVVGYMEKQD IRNKDATYIV LQ