Gene Apar_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1012 
Symbol 
ID8413884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1144080 
End bp1145129 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content47% 
IMG OID645022601 
Producthypothetical protein 
Protein accessionYP_003180032 
Protein GI257784815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.702386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGA ATGATTCATC TGATAGAAGA CAGGACTTCT ACGGAGACTC TGCAATGCAT 
CGCCGCCCTC CACAAAATAA CCGTCCTCCT CGCGATGATG GTTTTACTGA TAGGCAAGCT
AGACGTCGTA AAAGAAATAC TCTTAGTTCA CAGGCCATTC CTTTTAAAAG ACAGTCTTTA
CTTCATCGCA TTGGTCCTCG TACGCGAACC TATATTGTTA TTGGACTGGC CGTTATTGCA
ATACTTTTAT TGATCTTTAT TATTTCGAGC TGTGTTCGTA GTTGTGCAAA AGAATCAACT
CCAGGCGCGG AAGTCAACTC CGTTGATTCT CGTGTTGCTG TGGGAACTTC AGAAGAGCTA
ACCAAAGCGC TTGCAGCCAA ACTTGATCAG AACAAAAATC TTGCGTGGAT TGCAGAACAC
GCTGACAAAT ACAGCGATAA AAGCCTCATT GAGCTTGCAC TCGCCCATCC TGAAGCAATT
GATTTTGTTG CAAACTACCC TGATTCTGAC GGCAAAGCAA AAACCTATGA TGATTCGCTC
ACCAAAGGTA CGGCTCCCCA GTTCTATACC TGGGACTCCA GATGGGGCGG CGTCTCATAT
GCCGGCTCTG TAATTGCTAC CAAGGGCTCT GGTCCAGCTG CTCTCTCTAT GGCATACATG
GGACTTACAG GGAAGAACAA CTGGACTCCA GCAGATATTG CAGGGGCTAT TGAAACTGCC
AAAGCTTCTG ATACCGACTC TGGAATGAAC AAATCATTTC TCGAGAAGAA CTTAGCTGAT
CTTGGTCTTA CCGCTGATAG CTATAACATT TCTGCTGGTA ATATTACCGC GCTTCTAGAT
GCTGAAACCT TCCTACTTGT TGAGGTTAAA GACAACAAGC TGAGCTCAGA TGGTGCTCAC
TGGATACTGG TGACCAGCAA AAATGACGAC GGCACCGTAA ATGTTCACGA CCCTCTATCT
CCTGAGGTGA GCTCACGTCC ATGGGCGGCA GAGACTATTG CCAGCGCTGC AAATGCCCTA
TACACCGTGT CAGCAAAGAC TACTGAATAG
 
Protein sequence
MPLNDSSDRR QDFYGDSAMH RRPPQNNRPP RDDGFTDRQA RRRKRNTLSS QAIPFKRQSL 
LHRIGPRTRT YIVIGLAVIA ILLLIFIISS CVRSCAKEST PGAEVNSVDS RVAVGTSEEL
TKALAAKLDQ NKNLAWIAEH ADKYSDKSLI ELALAHPEAI DFVANYPDSD GKAKTYDDSL
TKGTAPQFYT WDSRWGGVSY AGSVIATKGS GPAALSMAYM GLTGKNNWTP ADIAGAIETA
KASDTDSGMN KSFLEKNLAD LGLTADSYNI SAGNITALLD AETFLLVEVK DNKLSSDGAH
WILVTSKNDD GTVNVHDPLS PEVSSRPWAA ETIASAANAL YTVSAKTTE