Gene Apre_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1682 
Symbol 
ID8398494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1829949 
End bp1831280 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content37% 
IMG OID644996045 
Productprotein of unknown function DUF21 
Protein accessionYP_003153423 
Protein GI257067167 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000471578 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACTG GACCCTACCA TAGCTTGATC ACAACGCTTA GTATAATATT GTTGCTTTTT 
ATAAATGGAG TATTTACAGC AATTCATACA GGACTTATTA GCCTTAACAC ATCAAAGTTA
GAAGAGGATT CTCTAAATGG AGATGAGAAG GCAGGATCTG TCCTTAAGAT TTTATCAAAT
CAGGATAGGC TCAATCAATC CTTTGAAATA GCAAATATAA TATTTTCTCT TTTAACCATA
GCTTACTTCG CAAAAAGGCT TAGAGAAGTG AGCTATGATG GGTATTTTTG GGGAGGAATC
TTATCAGAAA GATGGGTGAC TATAGTAGCT ATAGTGGTGT ATACCATTCT TAAGCTGATA
TTTGTAGACA AAATTCCCCA AAGAATTGGT GTAAGATTTC CAATGGAGAT TACCAAAATG
GCTACGGGAA TCACTAAGCT TATCATGGTT CTTACAAGAC CTATTGTGGC CTTTACTACA
GCTGTGACTA ATTTATTTAT GAATATATTT GGCATAGAGG CTAAAAATAT ACAAAAGCAA
GTTACAGGCG AGCAGATTAA ATCAATTGTC CAGATAGGTG AAGATCAGGG AATCCTAAGA
CCCATGGAGT CTAAGATGAT TCATTCCATA ATGGCTTTCG ACGACCTACT TGCAGAAGAG
ATTATGACAG CAAGGACTGA TGTCTTTATG ATCGATATAA ATGATAAGGA CAAGGAGTAT
CTCGAAGAGT TTGTCAAGAT AAAGCATGCG AGAATTCCAG TCTATGACGG GGAAGTAGAT
AATATCTTGG GCATTGTTTA TACCAAAGAC TTCCTCCTTG AGGCGACAAA AGTAGGGCTT
AAAAATGTAG AGATCAAGGA AATCATAAGG CCTGCCTACT TTGCACCAGA TAAGATAGAA
ACCGACAAGC TTTTTTCCGA TATGCAGAAA AAACATATCC ATATGGCGAT TTTGATAGAC
GAATACGGTG GATTTTCTGG GGTTGTCACA ATGGAAGATT TGATAGAAGA AATTGTAGGG
GATATAGATG ATTCCTACGA CTATGATATT CCAGAAATCA AGGAAAATGG CAGAGATGTC
TTCGTAGTCA AGGCTTCTGT AGGTATCAAG GACCTAAATG AGAAGATAAA TATAGGTATC
GATGAAGATA ACGAAAACTA CGATTCCTTG GGAGGTTTTA TTATAGACAG GCTTGGCTAT
ATACCGGAAG AAGACTCCAA GCTGTCCTTT GATTACAATG GTTACGAGAT CAAAATCCTC
TATATAGAAG ATAATAGAAT CAAGGCTGTT AGGATTAGAA AATTAAAAAA TAAAGAAGAT
AAAGAAGACT AG
 
Protein sequence
METGPYHSLI TTLSIILLLF INGVFTAIHT GLISLNTSKL EEDSLNGDEK AGSVLKILSN 
QDRLNQSFEI ANIIFSLLTI AYFAKRLREV SYDGYFWGGI LSERWVTIVA IVVYTILKLI
FVDKIPQRIG VRFPMEITKM ATGITKLIMV LTRPIVAFTT AVTNLFMNIF GIEAKNIQKQ
VTGEQIKSIV QIGEDQGILR PMESKMIHSI MAFDDLLAEE IMTARTDVFM IDINDKDKEY
LEEFVKIKHA RIPVYDGEVD NILGIVYTKD FLLEATKVGL KNVEIKEIIR PAYFAPDKIE
TDKLFSDMQK KHIHMAILID EYGGFSGVVT MEDLIEEIVG DIDDSYDYDI PEIKENGRDV
FVVKASVGIK DLNEKINIGI DEDNENYDSL GGFIIDRLGY IPEEDSKLSF DYNGYEIKIL
YIEDNRIKAV RIRKLKNKED KED