Gene Apar_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0082 
Symbol 
ID8412925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp91647 
End bp93119 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content51% 
IMG OID645021649 
Producthypothetical protein 
Protein accessionYP_003179109 
Protein GI257783892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTGCC CCAATTGCGG AGTTGAAATT GATGAGCACA CAAATTTCTG TCCCAACTGT 
GGACAGAAGC TTTCAGACGC CGCGGATGTT GCGGATGTTG CAGACACTGC AGATGTTGCA
GACGCCGTGG ATGTTTCAGA CGCCGTGGAT GTTGCAGACA CTGCGGATGT TGCAGACGCC
GCGAATGTCG AGAAAACCGA TGCGCCTGCT GAGTCTACAG GACCTACACC AGAAACCGCA
CCCGACAACT CCACCACAGC CACCGAGGTT ATCGCACCAG CAGAGTCTGA TACGCCAACT
GAAACGGTAA CTGACCCAGA CGCTACCATC CTCGCAGAGC ATCCAGCTGA TGACGCAGCT
GATGACGCAA CCGATGAGGA ATCTACAACC TTCATTAACG ACCCTGACCG CATCAACGTC
TTTACCACCG ACGACCTTGA TTCAACTGCC ATTTCTCACT TTGACCCTTC ACAGTTCACC
GTCATTTCAA ACGGACCTAG CTCTGTTGAA CGCAAACAAA CCAAGGACGG TTCGGACACG
CTCCGCAAAG TTGCATTTGG CTTGCTGGCA GCCGTCTTAG TCGTAGGCGC ATTGGCCATT
GTATGGGTCA TCCACTCTCA TCAAGACGAT GCAGCCAGGC GCAACGCTGT CCAGCAGCGT
ATTGCCAGTT ATCAAGAGCA AATCAAAAGC GTTGACGTTA ATCCAACCAG CGACAGCAGT
CGTGTTGAAC TTCTCAACCA ATACGAGAAG CTCGATCAAA TTGAAGAGCA AATCACTGGT
GACCAGAAAA ATGGCCAGTT CCGCCTTCCA AACGGCACCG ATGACTCAAG TGTCAACGTC
CTAAACAGCT CAATCTCGGA TGGGCAGAAG AAGATTCGCG ACTGGTTTGA GGCCGACTAC
AAGCGCAGGC TGGCTGCAAA CAGCTTTAAC GATACCGATT CAGCAACCAC GCTTGACCTC
AAGTCCGTTG CTAATAGGCT TGTTGAACTG CAGGCTCTTC TCGGTGACAT AGAGAACGAG
AAGGAAATCT GGGGCAATGA TACCGGCACT AATTCAACAT ATGACTCATA CCACTCTCGT
GTTTCTGACC AGATAAAGAA AGGTGAGAGT CTAAAATCTG GCGTGACTGA GAAAAACAAG
AACGAGCAAG AGAAGAAGGA TAAGGACAAA AAGTCCGAGG AGGACCGCAA GAAAGCCGAA
AAGTGGGTTG GTACCTATAG CGGAACCGGC ACCGACGGCA AATCTATGGA GGTTGTCATC
CAGAAAGATG GCACGGTCCT TTGGCGTATT GGTGGTCAGC CTGAGGTTCG CGGAACCTGG
ACCGGCGACG AGAGCAAGCT TGAGCTTAAC TTCAATGGTC AGGTTAGTGG CAGAAGCGAG
CCTTTTACCC TCTCCTCTAC CGACGGCGGC AAAACCGTTT CTATCAGCTC TCAAAGTAGC
ACGTGGAACA CAGATACGCT TTCTAGAAAG TAA
 
Protein sequence
MNCPNCGVEI DEHTNFCPNC GQKLSDAADV ADVADTADVA DAVDVSDAVD VADTADVADA 
ANVEKTDAPA ESTGPTPETA PDNSTTATEV IAPAESDTPT ETVTDPDATI LAEHPADDAA
DDATDEESTT FINDPDRINV FTTDDLDSTA ISHFDPSQFT VISNGPSSVE RKQTKDGSDT
LRKVAFGLLA AVLVVGALAI VWVIHSHQDD AARRNAVQQR IASYQEQIKS VDVNPTSDSS
RVELLNQYEK LDQIEEQITG DQKNGQFRLP NGTDDSSVNV LNSSISDGQK KIRDWFEADY
KRRLAANSFN DTDSATTLDL KSVANRLVEL QALLGDIENE KEIWGNDTGT NSTYDSYHSR
VSDQIKKGES LKSGVTEKNK NEQEKKDKDK KSEEDRKKAE KWVGTYSGTG TDGKSMEVVI
QKDGTVLWRI GGQPEVRGTW TGDESKLELN FNGQVSGRSE PFTLSSTDGG KTVSISSQSS
TWNTDTLSRK