Gene Apar_1255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1255 
Symbol 
ID8414134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1404973 
End bp1406067 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content40% 
IMG OID645022847 
Productprotein of unknown function DUF871 
Protein accessionYP_003180271 
Protein GI257785054 
COG category[S] Function unknown 
COG ID[COG3589] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCC TTGGAATTTC AATTTATCCT GAAAAGAGTA CAAAAGAGGA TATTCTTGCA 
TATTTAGATC GAGCAGCTAG TGCTGGATTT TCTCGCATTT TTAGTTGTCT TCTTTCGGTT
CAGGACACTA AAGAAGCTAT TGTGCAGAAG TTTTCTGATA TCAATGCACA TGCGCATCGT
CTTGGCTTTG AGATTATCGT TGATGTCAAT CCAAGAGTTT TTAAAGAGCT TGGAATTTCA
TATGACAATC TTTCTTTCTT TAAAGAGATT GGTGCAGACG GCTTTAGGCT TGATTTAGGG
TTTACTGGTA AAGAAGAGTC TTTTATGACG TTTAATCCAG AAGGATTATT TGTTGAGCTT
AACATGAGCA ATGACGTGGA CTATCTTGAC ACCATAATGA AATACCAACC AAATAAAAAT
AAGCTTATTG GTTGTCATAA CTTTTATCCT CATGGCTATT CTGGATTGGG CCTTGAATTT
TTTAACCGTT GCACTGAGCG TTTCACAAAG TATGGGATTA ATACTGCAGC ATTTGTGACC
AGTCAGGCTC AAAATTCGTT TGGCCCATGG CCGGTGACCG AAGGTCTTCC TACACTAGAG
ATGCATCGAC ACATGCCTAT GCACCTGCAG GTGGAACACT ATATTTCAAT GGGAACTATT
GACGACGTCA TTATTTCTAA CTGTTTTGCT ACAGAATCTG AGTTTGAAAA AATAAAGTCC
TTACCGCATG ACCTGGTGAG CTTTGGTGTG AAGCTTGTGG ATACCATTCC ATCTGTCGAG
AAGAGCATTG TACTTGAGGA ACTTCACTGT AATAGGGGAG ATGTTTCTGA CAACCTAATT
CGTTCATCTC AGAGTCGTGT GAAGTATAAG GAACATACCT TTGACGTATT TAATGCACCT
TCTACTATTC GTCGAGGGGA CGTTATTATT GAAAGCAGCC TCTATGGCCA TTACGCAGGA
GAGATGCAAA TTGCTCGAAC GGATATGATT AATACTGGTA AAACAAATGT TGTTGGTCAT
ATTCCAGACG AAGAGCACTT TTTGATTGAT ACCTTACAAC CATGGCAGAA ATTCCGTCTC
CACGAAGTGT TGTGA
 
Protein sequence
MKRLGISIYP EKSTKEDILA YLDRAASAGF SRIFSCLLSV QDTKEAIVQK FSDINAHAHR 
LGFEIIVDVN PRVFKELGIS YDNLSFFKEI GADGFRLDLG FTGKEESFMT FNPEGLFVEL
NMSNDVDYLD TIMKYQPNKN KLIGCHNFYP HGYSGLGLEF FNRCTERFTK YGINTAAFVT
SQAQNSFGPW PVTEGLPTLE MHRHMPMHLQ VEHYISMGTI DDVIISNCFA TESEFEKIKS
LPHDLVSFGV KLVDTIPSVE KSIVLEELHC NRGDVSDNLI RSSQSRVKYK EHTFDVFNAP
STIRRGDVII ESSLYGHYAG EMQIARTDMI NTGKTNVVGH IPDEEHFLID TLQPWQKFRL
HEVL