Gene Apre_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0549 
Symbol 
ID8397326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp633847 
End bp636150 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content38% 
IMG OID644994907 
Producthypothetical protein 
Protein accessionYP_003152316 
Protein GI257066060 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AACTTGTAAA AAGCTTATCT GTAGCGCTCG CAGGATCCAT GCTTGTGACA 
AATGTAGTTT TTGCAGATGG GACAGTTAAG AAAAACGAAA CCTTGTATGT AACACAAGAA
GAAAATGAAA TAAAAGATAA GACAGCTTCA ATCTGGATCA ACTCAGATGG GAATGTAAAG
GTTAAGGATA AGTCTAATCT TAAAGATATC AAAAACCTAA AGACAGATGA AAAAATTGAT
CCAAAAGATG GTTATATAAA TTGGAATGAA GATAGCAAGG ATGTCTACTA TCAAGGACAA
GCAGAAGAAG ACCTTCCTGT AGATATTTCT GTAAAATATT TTTTAGATGG TAAGGAGACT
AAGGTTAAAG ACCTAGAAGG AAAATCTGGT CACCTAAAAA TCGAAATATC AGCTAAGAAC
AAAAGAAGCG GAATAGCTAC AGTAAATGGT AAAGAACAAA AGGTATTTTC ACCATATCTA
GTACTTGCTG AAATTAACTT CGATGAAGAT AAGGTTACAA ATATAAAAAC AGACGACGGC
AAGGTCGTAA AAGATGGTAA AAACGAAGTT GTCGCAGGAG TTCTCGCACC AGGACTTAGA
CAAAACTTTA AAGAAATCCT TGAAGATGAT AAACTTGATA AGTTCAAGGA TAAAATCGAG
ATGGAAATGG ATGTAAAAGA CTACAAGCCT ACTGAGGTCT ATGCAGTAAT TACAAACGAA
ATCTTCCAAG ATTCCGCTAA TATCTCATCC TTAGACGATT TAAACAAGGG AATTGATGAG
CTTGAATCAA ATGCAGCTAA GCTTGTTGAT GGATCTAATC AATTGGCAGA TGGGGGCAAT
AGATTAAATG ATGGTATAGG TCAGTTAAGT AATGGAGCTA GCGAGCTTGC AGCAGGTTCT
GGCAAAATCG TATCATCATT CGATCAAATG GCTAATGCCT TTGCGGACCT ACCTAACCAA
GTCGGAACAA TGACTTCAGC AATTGACATG CTAAATAATG GAGGAAGCAG TCTAGATCAA
GGTATCAACC AATACACTGC CGGAGTTAAT AAGATTAATG AAAATATGCC AAAGCTTACT
GAAGCTAGTA GACAACTAGA AGCTGGAGCA AGTGACCTAG ATAATGGTCT TCTAAAACTT
GATAATGCAA CATCAACACT AAAAGAAAAA ATGGGAATGA CTGATGGCAA AGAAGACTTG
GGCAAATTTA GCGAATCCAT GAATGAACTA AGGACAGGAC TTGATAGCCT ATCAACAGGA
ATTGCTCCAT TAAGTCAAGG AGTAAGTGAG CTTAGCCAAG GATTAAGTAA ACTTGAAGAC
TCAAGTCGAG AGCTAAGTGG TGGAGTTAAT AATCTCCTAT CTTCCACCCA AAATATGCCT
AGTCTTGATC AAAATTCAGC CGATCTTACA GCTCAAGCTC AAGCAATTGA TGCAGTGATA
GCTAACTTAG AAGAAAATAA CGAAGACGGA AATCTAACAG GTCAAATCGA AAGTCTAAGA
GCTGTTAGAG ACAATCTCTA CAAAGAAAGT GAAAGCGTGG CTACAGCAGG ATATGCTTAT
AGCCAAATCT CAGGATCTCT TGAGCTTCTT TCCCAAGGAG CAAGCGAGCT TAGCGGAGGA
ATCAGCGGGG CTTATAAGGG AGCAAGTGAT ATCGATGAGA AATTAAAGGC TTCATCAGAT
AAGATGATTA CGGCTTCAAA GAGCCTTGCT GCAGGAACTG AGAAAATAAG CCAGGGATTC
GATAAGAGCA ATCTAGCCAA ATTACAAGAA TCTATAGTTA TGCTAGATGA AGCTACAGGA
AAGTTAAAAC AAGGATCAGC TAAACTTAAG GAAGGAACTA GCCAAAATAG GGCAGGTGTT
GAAAACTTAG CAGCTGCTGT ATCTGAGCTT GACGGTAACT CTCAAAGCCT AAGAGAAGGT
AGTGCAGAAC TTTCAGGAGG ATTGGCACAA TTTGCCGAAA GAAGCAAGGC CCTATCATCC
TTAGGAAACA TCAATGAAGA AGCTATAAAC CCTATGGCTG CAGGGCTAAA CCAACTAAAT
GATGGAATAA TGAAGCTTGA TTCATCTACC AAAGAATTAA AAGACGGAAG TGATCAATAT
ATGGCTTCAT TTGAAGAGTT TAGATCTGGC CTAAGCGAAT ACAAGGCAAA GGGAATCGAT
GAAATTGCCA ATAAAACATC TGAAGTAAAT GAAATTTCAG AAATCTTAGA TGAAATGAGT
AAGCTTGCTA AAGAAAACAA TTCAATTACA GGCACAAGTG ATGACTTCGA AACAAGATCT
AGGATAATTC AAAAGATCAA ATAA
 
Protein sequence
MKNKLVKSLS VALAGSMLVT NVVFADGTVK KNETLYVTQE ENEIKDKTAS IWINSDGNVK 
VKDKSNLKDI KNLKTDEKID PKDGYINWNE DSKDVYYQGQ AEEDLPVDIS VKYFLDGKET
KVKDLEGKSG HLKIEISAKN KRSGIATVNG KEQKVFSPYL VLAEINFDED KVTNIKTDDG
KVVKDGKNEV VAGVLAPGLR QNFKEILEDD KLDKFKDKIE MEMDVKDYKP TEVYAVITNE
IFQDSANISS LDDLNKGIDE LESNAAKLVD GSNQLADGGN RLNDGIGQLS NGASELAAGS
GKIVSSFDQM ANAFADLPNQ VGTMTSAIDM LNNGGSSLDQ GINQYTAGVN KINENMPKLT
EASRQLEAGA SDLDNGLLKL DNATSTLKEK MGMTDGKEDL GKFSESMNEL RTGLDSLSTG
IAPLSQGVSE LSQGLSKLED SSRELSGGVN NLLSSTQNMP SLDQNSADLT AQAQAIDAVI
ANLEENNEDG NLTGQIESLR AVRDNLYKES ESVATAGYAY SQISGSLELL SQGASELSGG
ISGAYKGASD IDEKLKASSD KMITASKSLA AGTEKISQGF DKSNLAKLQE SIVMLDEATG
KLKQGSAKLK EGTSQNRAGV ENLAAAVSEL DGNSQSLREG SAELSGGLAQ FAERSKALSS
LGNINEEAIN PMAAGLNQLN DGIMKLDSST KELKDGSDQY MASFEEFRSG LSEYKAKGID
EIANKTSEVN EISEILDEMS KLAKENNSIT GTSDDFETRS RIIQKIK