Gene Apre_0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0411 
Symbol 
ID8397185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp466894 
End bp468102 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID644994769 
Productprotein of unknown function UPF0118 
Protein accessionYP_003152181 
Protein GI257065925 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAC TAGATAAAAA ATCTAGAGAT TTATTAAAGG TTATTTGTTA TGGTATTATA 
TTATTTTTTG CCTTCTGGTA TTTTCCAGTT ATAAAAGATG GCCTCGCTAG GGTAGTTGGA
GTCTTCCAGC CCTTTATCAT AGGAGGGATG ATTGCTTATC TAGTGAGTAT TCCTATGAAT
TATTTTGAAA GAAAACTCAG AGCAAACTTC CCAGATAAGA AGTATAGGAA AAGGATAAGT
GCATTATCTT TATTTGTATC ATGGGTACTT ATTATATGTT GCTTGATTTT ATTTTTAAAC
ATCCTGATTC CAAGGATTGT TGCAGTCATC TTCTCCTTCT TCAATAGGTG GCCTGAGTTT
ATTAGAGAAA CTTACGAGAC TTTGAACAGT CACGCTATAA CAAGGCCTTA TGCTGATAAG
TTCTACGAAT ATGTGAATTC ATTCGGCTGG TATGAGGTGA GAAATGCTGT AATGAATTTT
ATAACAGACA AGAAGACTAA TCTTTTTAGT CTAACTACAG GAGTCCTTAA CTCAGTAAGC
TCTTCTTTGA TTACGATCTT TACGATAATT GTCTTTTCGA TTTTTGTCCT AATCTATAAG
GATATGCTAA AGACAAATGG AACAAGGATT ATCTATGCTT TGATGAGTGA AAAGAAGGCG
GATTATATAA ACAAGGTCCT ATCCCTATCT TATAACACCT TCAAGGATTA TATTTTCTCA
AGGCTTATAG CTGTGGTTAC CCTATCAGCC TTAACCTTTG TGGGCATGTT TATTATGGGC
ATCCCCAACG CTGGGGTCAT CTCGCTTTTT GTGGGAGTGT CAGATTTAAT TCCAATCTTT
GGTCCCATAG TTGGTGCAGG TCTATCGGCA GTCATTATAT TTTTGGAAAG TCCAGTCAAG
GCTTTAATTT TCCTAATCTA TGATGTAATA ATCCAGCAAA TCCAAGAAAA TATTATCTAT
CCTGCCATAG CTGGAGAGAA GATTGGCCTT CCTGCAGTAT GGGTCCTTGC AGCAATTACA
ATAGGTGGGT CGCTCTTTGG CATATGGGGT ATGCTTATCG GTATTCCTGT AGCTTCTGTA
ATATATACGC TCTTTCATGA GTTTATTGAT AATAAGCTTA AGGCTAAGGA AATAACAGAT
AAGATGATCG AAGAAAAGAA GAATGAGAAA TACACCATGG AGGATATAGA TATTCATGAA
GCTCAGTAG
 
Protein sequence
MEKLDKKSRD LLKVICYGII LFFAFWYFPV IKDGLARVVG VFQPFIIGGM IAYLVSIPMN 
YFERKLRANF PDKKYRKRIS ALSLFVSWVL IICCLILFLN ILIPRIVAVI FSFFNRWPEF
IRETYETLNS HAITRPYADK FYEYVNSFGW YEVRNAVMNF ITDKKTNLFS LTTGVLNSVS
SSLITIFTII VFSIFVLIYK DMLKTNGTRI IYALMSEKKA DYINKVLSLS YNTFKDYIFS
RLIAVVTLSA LTFVGMFIMG IPNAGVISLF VGVSDLIPIF GPIVGAGLSA VIIFLESPVK
ALIFLIYDVI IQQIQENIIY PAIAGEKIGL PAVWVLAAIT IGGSLFGIWG MLIGIPVASV
IYTLFHEFID NKLKAKEITD KMIEEKKNEK YTMEDIDIHE AQ