Gene Apre_1278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1278 
Symbol 
ID8398067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1375597 
End bp1376766 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content42% 
IMG OID644995622 
ProductApbE family lipoprotein 
Protein accessionYP_003153022 
Protein GI257066766 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAA AGAAAGTATT TTACCCCTTG GCAATGGCCC TTGTAATAAC AGCCTGTGCC 
AATGGAGCAA ATGAGACAGA TAATAATAAG GACCAGGTAA GCCAAGCGGA AGAAAAAGTA
GAGAAAAAAG AGGCAGAGGA AGACACAAAA GCAGAAGATG TAGGAGAAAA TCAAGAAGAT
AAGAAAAGCA CTGAAGTGCC AAAACTTGAC AAAACCTACT ACGACTATTT CGACACTGTA
ACTACCCTTC TAACCTACTC TGATGATGAA GAAAGCTTCA AGAAACAATG CGACGTCCTC
GAAGAAGAGC TAGCTAGATA TCACAAGCTC TACAACTCCT ACGATTCCTT CGAAGGAGTA
AATAACTTCA GAACAATCAA CGAAAAGGCA GGAATCGAGC CAGTCAAGGT AGACCCTGAA
ATAATCGAGC TAATCGAATA CTCAAAGAAA ATGTACGAAC TAACAGACGG AAACATCAAC
ATAGCCATGG GATCTCTCCT AGGCTTGTGG CACCAATACA GGGAAATGTC CATAGATAAT
CCTGAAAAGG CAGCAATCCC ACCAGAAGAT GAGCTCATCA AGAAAAGCGA GCACGAAAAC
ATAGATGCCA TTGAAATAGA CAAGGAAAAC TCCACAGTCT ACATCAACGA CCCAGACGTC
CAAATAGATA TAGGAGCAAT CGGCAAAGGC TACGCCACAG AAAAAATGGC AGAAAAACTA
AAAGAAGCAG GATTTGAAAG AGGAATCCTC TCAGTCGGTG GAGATGACGT AATCATAGGA
GAAAATCCAA ACAACAGCCA AGGAAACTGG AAAATAGCAG TCCAAAATCC CTTCCTAGAA
GATAAAGAAA ATCCATACTC CACAGTAGTA AACGTCAAGA ACACCTCAGT AGTAACAAGC
GGTGACTACC AAAGATTCTT CACAGTAGAC GGCAAAAACT ACCACCACAT CATAGACCCA
GCCACCAGAT ACCCATCCGA CAAATGGAAA TCCGTATCAG TAAAAGCAGA CAGCATAGCC
CTAGCAGACA CCCTCTCAAC CTACTTCTTC ATAGTAGACC ACGAGACAGG ACTAAAAAAA
GCAGCTGAAA ACAAAGTAGA AGCATACTGG ATAGACCAAG AAGGAAACGA ATACAAAACC
GAAGGCTGGG AAAAAATAGA AGATAAATAA
 
Protein sequence
MRIKKVFYPL AMALVITACA NGANETDNNK DQVSQAEEKV EKKEAEEDTK AEDVGENQED 
KKSTEVPKLD KTYYDYFDTV TTLLTYSDDE ESFKKQCDVL EEELARYHKL YNSYDSFEGV
NNFRTINEKA GIEPVKVDPE IIELIEYSKK MYELTDGNIN IAMGSLLGLW HQYREMSIDN
PEKAAIPPED ELIKKSEHEN IDAIEIDKEN STVYINDPDV QIDIGAIGKG YATEKMAEKL
KEAGFERGIL SVGGDDVIIG ENPNNSQGNW KIAVQNPFLE DKENPYSTVV NVKNTSVVTS
GDYQRFFTVD GKNYHHIIDP ATRYPSDKWK SVSVKADSIA LADTLSTYFF IVDHETGLKK
AAENKVEAYW IDQEGNEYKT EGWEKIEDK