Gene Apar_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0800 
Symbol 
ID8413665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp881475 
End bp882566 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content46% 
IMG OID645022382 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_003179820 
Protein GI257784603 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.873611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.99103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCGC TTCAAACCAT CATTTCAACC TGTGCATCTG CAGCAGGTCT TTCTTCGCAT 
ATTGTTCCTT TTGCAGCAGG TAACTCACTC AAGGAGCGCC TGTCCGCTAA TCCTGTGCTC
ATGGCTCCCA TGGCCGGCGT CAGTGATGGT GTATATCGCA CTATGGCTCG AGCAGGCGGA
GCAGGTCTTG CCTATTCAGA GATGGTTTCT GTTGCAGGAA TTCACTTTGG TGGCGAGAAA
ACCTGGGAGC TTGTAGAGCC TCTTTCTACA GAGCCTGATA TTGCCGTTCA ACTTTTTGGT
TCAAAACCGG AGCAGTTTAA AGAAGCTGCA GCACAGATTA GTGAGCGCCT AGGAAATAAA
CTTGCCCTTA TTGATATTAA TATGGCCTGT CCTGTTCCTA AGGTTGTTAA AAAAGGGGAG
GGCTCAGCCC TTCTAGATAC TCCAGAACTT GCTGCAAATT TGGTAAAAGC TTGTCTTTCA
GAGGTTACTG TACCTGTTAC CGTCAAGATT AGACGAGGAA GACGGCAGGA TGAGGAAGTG
GCGCCTGAAT TTGCTCGTGC TATGGAAGCA GCTGGTGCAT CTGCCATTGC TGTTCATGGA
CGTTTTGCTA CACAGATGTA TCGAGGTCAA GCAGAGTGGG AGACGGTTAA TAGAGTTTGT
GATGCCGTAT CTATTCCTGT AATAGGTTCT GGTGACATGC TCTCTGCTGA AGATGTAGCT
CATGCTCTTA AAGAAACAGG TGTGACTGCT GTTATGATTG CTCGCGGAAC ATACGGAAAT
CCTTGGATCT TTAAGAACAC TCAAGCTTTA CTTGAGGGTA AAACTCCCTT TGTTCCATCA
GTTGATCAAA TGCTTGCGTC GTTCATGATG CACGTAAAGT TACTCGATGC TGCCCACATT
CATATTGCCC GTGCAAGAAG CCTCTCTACC TGGTATTTTA GAGGTATTCC AAATGCAGCT
TATTGGCGTG GCAAGTCAGT GAGATGTGTT ACTGCTCAAG ATTTTATTGA TCTTGCATTA
GAAATTAAAG AAACAGTTAC ACAACTTGTG GGCAAACAAG AAGCATCTGA ACACAAGCTG
TCTTTAAGCT AG
 
Protein sequence
MKSLQTIIST CASAAGLSSH IVPFAAGNSL KERLSANPVL MAPMAGVSDG VYRTMARAGG 
AGLAYSEMVS VAGIHFGGEK TWELVEPLST EPDIAVQLFG SKPEQFKEAA AQISERLGNK
LALIDINMAC PVPKVVKKGE GSALLDTPEL AANLVKACLS EVTVPVTVKI RRGRRQDEEV
APEFARAMEA AGASAIAVHG RFATQMYRGQ AEWETVNRVC DAVSIPVIGS GDMLSAEDVA
HALKETGVTA VMIARGTYGN PWIFKNTQAL LEGKTPFVPS VDQMLASFMM HVKLLDAAHI
HIARARSLST WYFRGIPNAA YWRGKSVRCV TAQDFIDLAL EIKETVTQLV GKQEASEHKL
SLS