Gene Apar_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1234 
Symbol 
ID8414113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1385749 
End bp1387068 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID645022827 
Producthypothetical protein 
Protein accessionYP_003180251 
Protein GI257785034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000907369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA CAAGAAGAGA TGTAGTAGCC GGTCTTCCAG TTTTGATGGC ATTCGGACTT 
GCTGGTTGCA AAACCGCAAA AGCAACCGCT AATAACACAA CTGAAGACAC CAAGAAGGCT
TCAGAAGCAC TTCGGATTAC CGCCGCTCAG ACCCTGGATG GCTATGCAGT AAACAGCACT
ACTGCTGATG AGAATACCAT CCTGGTCAAT ACTACCGACG ATGTAACCAT TACCAATTCC
ACGGTTACTA AGACAGGAGA TTCCGACGGC GGAGATAACT GTAACTTCTA CGGTCAGAAC
GCTGCAGTAC TTGTTGAGGG AGGTTCAACC ACTACGCTGA CTAACCTCAC TGTTACTTCA
GATGCAAAGG GTGCCAACGG CATTTTTAGC TACGGAGGCA ACGGCGGTCA GAACGGCGGT
GACGGTGATG GCACCAAGGT TATTATCAAG GACACCACCA TCACTACAAC TGGCGACGGC
GCGGGTGGCA CCATGACAAC CGGTGGCGGC ACCACCAATG CCTACAACCT CACGGTTACC
ACAAACGGTC AGTCTTCTGC AGCTATTCGT ACCGATAGGG GCGGCGGAAC AGTTTACGTA
GACGGTGGTA CCTATACCTC CAATGGTCTA GGTTCGCCAG CCATCTACTC CACGGCAGAG
ATCCACGTTG CTAACGCCAC ACTTGTTTCT AACCTTTCCG AGGGCGTTTG TATTGAGGGC
TTGAACTCCA TTGAGCTTAC CGATTGCGAC CTTACGGCAA ACAACACCAA GTGCAATGGC
AACGCAACCT TCATGGACAC CATCATGATT TACCAGTCCA TGTCCGGAGA TGCAGCAACA
GGTAATTCCA CCTTTGCTAT GACTGGTGGT TCCCTCACCA GCAAGAACGG TCACATGTTC
CACGTTACTA ACACTAACGC TGACATTGAG CTCAATGGCG TCAAGTTAAC TAACGAAGAC
GCTGCTAACA TTCTTATCTC TGTCTGTGAT GACGGTTGGA ATGGCGGTAA TAATAAGGCA
ACCTTTAACG CTAAAGCGCA GGATCTGGTG GGAGCGGTGC TTGTTGGCAA CAACTCCACA
CTTGCTCTGA ACCTTACCGA AGGAACCACG TTTGAGGGTT ACGTTAACGG CAACATCGTC
AACGCCACTA ACCAGACTGT TTCCACTGAA GTTGGTACTG TTGCGGTAAC ACTGGATAAC
AACAGTACTT GGACTTTGAC AGCAGATAGC TATGTCACCG AGTTCAATGG TACTGCAGCA
AACGTTATTG CTAACGGTCA CACACTGTAT GTAAAAGGTA CGGCACTCAC GGGAACCTGA
 
Protein sequence
MQYTRRDVVA GLPVLMAFGL AGCKTAKATA NNTTEDTKKA SEALRITAAQ TLDGYAVNST 
TADENTILVN TTDDVTITNS TVTKTGDSDG GDNCNFYGQN AAVLVEGGST TTLTNLTVTS
DAKGANGIFS YGGNGGQNGG DGDGTKVIIK DTTITTTGDG AGGTMTTGGG TTNAYNLTVT
TNGQSSAAIR TDRGGGTVYV DGGTYTSNGL GSPAIYSTAE IHVANATLVS NLSEGVCIEG
LNSIELTDCD LTANNTKCNG NATFMDTIMI YQSMSGDAAT GNSTFAMTGG SLTSKNGHMF
HVTNTNADIE LNGVKLTNED AANILISVCD DGWNGGNNKA TFNAKAQDLV GAVLVGNNST
LALNLTEGTT FEGYVNGNIV NATNQTVSTE VGTVAVTLDN NSTWTLTADS YVTEFNGTAA
NVIANGHTLY VKGTALTGT