Gene Apre_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1231 
Symbol 
ID8398020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1314458 
End bp1315726 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content39% 
IMG OID644995576 
Productprotein of unknown function DUF1063 
Protein accessionYP_003152976 
Protein GI257066720 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000111726 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACT TTAAAGAGTT AATCAAAGAA GAGCTAATAC CTGCCACAGG TTGCACTGAA 
CCAATTGCTA TAGCCTACGC ATCAGCTAAG GCAAGAGAGG TTCTAGGATC TGATCCAGAA
AAAATTATCG CAAATCTTTC TAGCAATATC ATTAAAAACG CCAACTCTGT AACCGTCCCT
TCCACCATGG GTAGGAAGGG AATAGAGATA TCAGTAGTTG CTGGTATATA TTTAGGGGAC
CCTAATAGGG AACTTGAAGT CCTAGCAGAT GTTGATAAGA GTAAGCTTGA TTTCTGTGAT
AAGATTATAG AAGAAGGGCT GGTAAGGGTA AACCTCGCTA GAGAACACGA GGGGCTTTTT
ATCCAAATCA TCCTTGAAAA TAAAAAGTCT ACAGCAAGCG TCACTATAGC TGATAGCCAT
ACCAATATTA TTGAAATAAA AAAGGACGGC AAGCTAATAT ATCAAAAGGA AAAGGAAGAA
GCAGTAAAAG AGGATATTGA CTTTTCTTTC GATAAGGTTT ATGATTTTGC TAGAACTTGT
GATTATTCTG ATATCAAGGA AATTCTCGAT AGGCAAATTT CTTTTAATGA AAAAATCGCA
GAAGAAGGAA TCAAAAACGA CTGGGGAGCT AATATAGGAA AACTCATCCT AAATAATGAC
CCATCAAACT ACTATGAGAA GCTCGCTGCC TTTGCTGCAG CAGGATCTGA TGCTAGGATG
AACGGCTGTG AGCTGCCTGT AATCATTAAC TCAGGATCAG GAAATCAAGG AATTACTACC
TCAGTCCCTG TAATCTTATA TGCCAGAGAC AATGATTTCT CAGAAGATGA GCTCTACAGG
GCCCTTATAT TTTCTAATCT AATTGCTTTG TATATCAAAA ACAAAATAGG CAAGCTTTCT
GCCTACTGTG GAGTAGTATC TGCCTCTGCT GCAGCTATCG CTTCCATAGC TTTTATAAAC
AAAGAAGATA AGAAGATTGT AGAAGATACG ATTACTAACG CCCTAGCCGT AAACTCCGGA
ATAATATGTG ACGGGGCCAA GTCCTCTTGT GCTATGAAGA TCGCTTCAAG CCTTAGAAAT
GCGAGCCTTG CCTATATGCA GGCCAAGACA GACAATTCCT TTGAAGTAGG AGATGGCATA
GTCAAAGAAA ATATAGACAA AACGATCGAT ACAGTTGCAA GAATTGCAAA ATACGGAATG
AAAAAGACTG ACGAGGTCGT CTTAAGCGAG ATGATAGGCA AGGATGACTA TCTCGAAGAC
TTTGAATAA
 
Protein sequence
MTDFKELIKE ELIPATGCTE PIAIAYASAK AREVLGSDPE KIIANLSSNI IKNANSVTVP 
STMGRKGIEI SVVAGIYLGD PNRELEVLAD VDKSKLDFCD KIIEEGLVRV NLAREHEGLF
IQIILENKKS TASVTIADSH TNIIEIKKDG KLIYQKEKEE AVKEDIDFSF DKVYDFARTC
DYSDIKEILD RQISFNEKIA EEGIKNDWGA NIGKLILNND PSNYYEKLAA FAAAGSDARM
NGCELPVIIN SGSGNQGITT SVPVILYARD NDFSEDELYR ALIFSNLIAL YIKNKIGKLS
AYCGVVSASA AAIASIAFIN KEDKKIVEDT ITNALAVNSG IICDGAKSSC AMKIASSLRN
ASLAYMQAKT DNSFEVGDGI VKENIDKTID TVARIAKYGM KKTDEVVLSE MIGKDDYLED
FE