Gene Apre_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1035 
Symbol 
ID8397822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1104654 
End bp1105838 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content31% 
IMG OID644995383 
Productprotein of unknown function DUF795 
Protein accessionYP_003152784 
Protein GI257066528 
COG category[R] General function prediction only 
COG ID[COG1323] Predicted nucleotidyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TTGCAATTAT TTCTGAATTT AATCCATTTC ACAATGGACA CAAATATTTA 
ATAAACAAGG CAAAGGAAAT TACAAAAACA GACTTAGCTA TTAGCTTAAT GAGCGGTGAT
TTCGTTCAAA GAGGTGAAGC GAGTCTTATA GATAAGTATT CTAGAGCTGA CGCTGCCTTA
GATAATGGCT TCGACCTTGT TATAGAGATG CCTAACTTTA TATCTCTGCA ATCAGCGGAG
TTTTTCTCCT ACAAATCCAT CGAACTTTTA AACAAATTAA AGATAGACTA TCTTGCTTTT
GGAATAGAGA ATTTAGATAG TGAAGAATTT CTTGATATTT CAGCTAGGTT AATAAAAGAT
AATGATAGAT TGGAAGAATT AACTAAATAT TATATTGATA AAAAATATTC ATTTACTGAA
GCAAAGTACC TTGCTCTCAA AGACTTCCTA GGAAGAGAGG ATTTTATAAG TTCTAACAAT
ATCCTTGCCC TCGAGTATAT GATATCAATT AGTAAAATCA ACCCAAATAT TATGGCAATT
CCTATTAGAA GGCTTGGAGC AAATAACCAA GACCTAGATA TAAAAGATGA AAAGTATGCC
TCATCTACAT CAATAAGAAG GAATCTTTCT GGAAATATAG AAAAACTTAT GCCTTCCTCT
TCCTATCAAA AATTAAAATC TTTTCAAAAA AATTATGGGC TAGCCAATAA GGAGAATCTT
TTTGAGATTT TCAAATATAA ATTTATGATT GAAGAAAGTC AAATGCAAGA TTCCTTGTGC
TATGAGGAGG GTCTAGATAA TTATTTCAAG ACCTTGTTAA AAGATTCGCC CACCTACGAT
GAATTTATTG AACTTGCTGT ATCAAAGCGT AATACAATGG CGAGGATTAA GAGATTAATG
TTAAACTATA TACTAAATAA TAAAAAATCT CTTAATGATC TTGATTATAA TTTTGTTAAA
GTTCTTGCTT TTAATGAGAA AGCTACAAAA CTTTTTAGAG ATATTAAAAA AGAATTGAAA
ATTGTTATAA GAAAGTCTGA TATAGAAGCA TTAGACCACG ACGATCTTCT TGTGTACGAA
AACATGCTAA GGGCAAGCAA CCTCTACTCA CTCCTAATAG ATAGACAGTT TAATACAGAC
TTCACTAGAA AAATTTCTAT TAAAAAAACC TATGAGGCCA ATTAG
 
Protein sequence
MKKLAIISEF NPFHNGHKYL INKAKEITKT DLAISLMSGD FVQRGEASLI DKYSRADAAL 
DNGFDLVIEM PNFISLQSAE FFSYKSIELL NKLKIDYLAF GIENLDSEEF LDISARLIKD
NDRLEELTKY YIDKKYSFTE AKYLALKDFL GREDFISSNN ILALEYMISI SKINPNIMAI
PIRRLGANNQ DLDIKDEKYA SSTSIRRNLS GNIEKLMPSS SYQKLKSFQK NYGLANKENL
FEIFKYKFMI EESQMQDSLC YEEGLDNYFK TLLKDSPTYD EFIELAVSKR NTMARIKRLM
LNYILNNKKS LNDLDYNFVK VLAFNEKATK LFRDIKKELK IVIRKSDIEA LDHDDLLVYE
NMLRASNLYS LLIDRQFNTD FTRKISIKKT YEAN