Gene Apar_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1254 
Symbol 
ID8414133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1403776 
End bp1404870 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content40% 
IMG OID645022846 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003180270 
Protein GI257785053 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTGA GCGAGAAAAT CATGAGCCTC AGAAAGCGGA ATGGTTGGTC ACAAGAAGAA 
CTTGCACGAC AGCTTAATGT TTCAAGACAG TCCGTTTCCA AATGGGAGTC TATGGCTTCC
ATGCCAGATA TTCAAAAGAT AATGACTATG AGTGAGCTCT TTGGTGTCTC AACCGACTAT
CTGCTTAAAG ATGAAATGGA AGATCTCCCT GCAACTGCAG CATCTTTAGA CTCTGCAGAA
ACCTCAAGTG AGAGTGCTAC TCCCGAATCT TCTTCACAAA CTAATGAGAA AGACAACTAC
TCCTCCACAA AGATTAAAGT TTCTCTGGAT CTAGCAACGG AATATTTGGA TACTATTGAA
AAGACATCTC GTACAACAGC CTTTGCCGTT ATGCTCTTTA TCTTAGGTCC TGCGATTCTT
GGGTCGCTTG CCACCTACGG AAAAACTCAA ACAGATTTTA ATCTCGCGAT TATCAATTCA
GGTGCACTCA ATACATCCAA CCTTCTTAAC ATCATAGGTG TTTCTATTAT GATGCTCTGC
ATTGCAGCTG GTGTAGGCCT TCTGATTTTA CAGAACGTAA AACTTTCACC ATTCAAAGAA
CTTAAAGAAA ATACCCTTGA TCTGCAATAT GGTGTTGAGG CAGCTGTTAA GCGACGCGCT
GAATCTACAA AGTCCCTGCG CTCTTTTCAG CAAGCTGCTG GAGTTTGTCT TACTATTCTG
AGCTCTATCC CTTTTGTTAT CGCATCGTAT TTTGAAACTG GTCTTTACTT CTCAATAGGA
TTTTTTATCG CCATGTTCAT GGTTGCTTTT GGGGTCTTTT TGCTCGTTAA TTCTGGTATT
GTCAAAAGTA GCTACAATGT TCTTCTCCAA GAAGATGACT ATTCAAATAG CGAGAAAATC
TCCAAAAATA AACAAAAGTC TATCTACGCC CAGTATAAGC AATATACGCA AGCTTACTAT
GTAGTTATCA CACTCATGTA TCTTGGCTAT AGCTTTATTA CCTATGACTG GGGTAGGAGC
TGGATCATTT GGCCTCTCTC TGCACTTCTC TACCACGCTG TCATTAGTGT TTTAGGTGCA
TTTAAAAAGA AATAA
 
Protein sequence
MLLSEKIMSL RKRNGWSQEE LARQLNVSRQ SVSKWESMAS MPDIQKIMTM SELFGVSTDY 
LLKDEMEDLP ATAASLDSAE TSSESATPES SSQTNEKDNY SSTKIKVSLD LATEYLDTIE
KTSRTTAFAV MLFILGPAIL GSLATYGKTQ TDFNLAIINS GALNTSNLLN IIGVSIMMLC
IAAGVGLLIL QNVKLSPFKE LKENTLDLQY GVEAAVKRRA ESTKSLRSFQ QAAGVCLTIL
SSIPFVIASY FETGLYFSIG FFIAMFMVAF GVFLLVNSGI VKSSYNVLLQ EDDYSNSEKI
SKNKQKSIYA QYKQYTQAYY VVITLMYLGY SFITYDWGRS WIIWPLSALL YHAVISVLGA
FKKK