Gene Apre_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1100 
Symbol 
ID8397887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1176008 
End bp1177360 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content39% 
IMG OID644995447 
Productcitrate synthase 
Protein accessionYP_003152848 
Protein GI257066592 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA AACAAGAAGA AAAACTAAAG TTCTATGCCA AAGAGATTGA AAGAAATAAT 
AAAATCAAAA AAGAAGTCTA TGAAAATTAT AATATAAAAA GAGGTCTTCG AAATAAAAAT
GGGACTGGAG TCCTTGTCGG AGTAACCCAG GTGGGTGATG TATCAGGCTA TAAAATCATA
GATGGAAAGA AAATCCCTAG CCGAGGTGAA CTCTACTATA GGGGCTACCC ACTTACAGAA
ATTGTTGAAG ACATAGAAAG GGATAAAAGA TTAGGTTTTG AGGAGATTAT TTACCTATTA
CTCTTTAGCA AACTAGCCAA TGAAGATGAG CTTAAATCCT TTAAGAGTAT CCTAGTCGAA
GAAAGAGCTC TGGCGGATGG TTTTTTTGAG GACATAATCC TAAAAGTTCC AGGGTCTGAT
ATAATGAATA AGATGATGAG GTCTATGCTC GCCCTTTATA CCTACGACAA AAATCCAGAT
GGGACAGATG CCCTAAATGT TCTAAGTCAA TCCTTATCTT TAATTTCCAA GATACCAATC
CTTGCAGTCT ACTCCTACCA GGTCAAAATC CACAATTTTG ATAAGAAGTC TCTGATAATC
CACAATCCAG ATGATAGGCT GACAATCGCA GAAAACATCC TTCAAATGCT GAGAAATGAC
CAAGCCTACG AGAAAGTTGA GGCTGAAATC CTAGACCTAA TGCTAATAAT TCACGCAGAA
CACGGGGGAG GAAATAACTC TGCCTTTGCA ACCCATGTAG TCTCTTCATC GGGGACAGAT
ACATATTCTG CTATAGCAGC AGGTCTTGCA TCCCTTAAGG GTCCCAAGCA TGGTGGAGCG
AATCTCAAAG TAAGTAAGAT GCTTAAAGAT ATTAGAGAAA ACGTGGATGA TTTAGATGAT
AGATCTAAGA TCAAAGCCTA TCTAGAAAAG ATTTTGGATA AAAAAGCCTT TGATAAGAAG
GGTTTAATAT ACGGTCTAGG TCATGCCGTC TACACCTTAT CAGATCCTAG GGCAATTCTT
CTTAAGAAAA AGGCCAGGGA ACTTTCTATA ATCAAGGGAA GGGAAGAGGA TTTTCACTTT
ATAGAAAATG TAGAAAAAAT AGGAAAAGAC TTGATAAGTC AAAGGCAAAA CAGGCCATAT
CCACCTTGCG CCAATGTAGA CCTTTACTCG GGCTTTGTCT ACGATATGCT AAAAATTCCT
GAAGAGCTCT ACCTTCCTAT GTTTGCTATA GCAAGAACAG TTGGCTGGTC TGCCCATAGG
TTAGAGCAAA TTCAAGATGA GAAGATAATC AGACCTGCTT ACAAGTCTCT AAATGACAGG
AGGGACTATC TTCCCCTAAG AGAAAGAAAG TAA
 
Protein sequence
MDKKQEEKLK FYAKEIERNN KIKKEVYENY NIKRGLRNKN GTGVLVGVTQ VGDVSGYKII 
DGKKIPSRGE LYYRGYPLTE IVEDIERDKR LGFEEIIYLL LFSKLANEDE LKSFKSILVE
ERALADGFFE DIILKVPGSD IMNKMMRSML ALYTYDKNPD GTDALNVLSQ SLSLISKIPI
LAVYSYQVKI HNFDKKSLII HNPDDRLTIA ENILQMLRND QAYEKVEAEI LDLMLIIHAE
HGGGNNSAFA THVVSSSGTD TYSAIAAGLA SLKGPKHGGA NLKVSKMLKD IRENVDDLDD
RSKIKAYLEK ILDKKAFDKK GLIYGLGHAV YTLSDPRAIL LKKKARELSI IKGREEDFHF
IENVEKIGKD LISQRQNRPY PPCANVDLYS GFVYDMLKIP EELYLPMFAI ARTVGWSAHR
LEQIQDEKII RPAYKSLNDR RDYLPLRERK