Gene Apre_1256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1256 
Symbol 
ID8398045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1348646 
End bp1350121 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content40% 
IMG OID644995601 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003153001 
Protein GI257066745 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TATCTATAGA GAAATTGGCA GATACTGTCA TTAACAAGAG AAAAGAAAAA 
GGAATAACTC AAAAAAGTCT TGCTGATACT ACTGGTATCA ACAGGGCTAT GATCAGTCGT
TTGGAATCAT GCGACTACAC ACCTTCTATC GACCAGCTAC AAGCTATAGG AGAAGTCTTG
GACTTTGAAG TAGTTGATAT GTTTGAGGAA GAAACTTACG AAAAAGAAAT TAAATCAGAC
AAAAAATACA AGATAGCTGT TGCAGGAACT GGATATGTAG GTATGTCCAT TGCGACCCTC
CTATCCCAAC ATAATGAGGT TACTGCAGTT GATATTGTAG AAGAAAAGGT AGAGAAGATT
AATAATAAGA TCTCTCCTAT CCAGGATGAT TATATAGAAA AATATCTCGA AGAAAAGGAC
CTAAATTTAA GAGCAACCAT AGATGGAGAG GCTGCCTACA AGGATGCTGA CTTTGTAGTA
ATCGCAGCTC CTACCAACTA CGATAGCAAG AAGAACTTTT TCGATTGCTC TGCTGTAGAA
GATGTAATCG AGCTTGTCCT TAAGGTCAAT CCAGAAGCTA CTATGATTAT CAAATCCACT
ATCCCAGTTG GTTACACTAG AGAAATTAGG GAAAGATATG AGACAGATAA GATTATCTTT
AGCCCAGAAT TCCTTCGTGA ATCCAAGGCT CTTTACGACA ATCTCTACCC TTCAAGAATC
ATTGTATCAT GCGATGATCA AAGTAGGGAT AAGGCAGAAA TATTTGCAAA TCTCCTTAAA
GAAGGCGCCA TCAAAAAGGA CATCCCTACC CTCTTTATGG GTTTTACAGA GGCAGAAGCA
GTCAAGCTTT TCGCAAACAC CTACCTCGCC CTTCGTGTAT CCTACTTCAA CGAACTTGAT
ACCTACGCAG AAAGCAAGGG ACTAAATACA GAAGAGATCA TCAACGGAGT ATGCCTAGAT
CCAAGAATAG GCACCCACTA CAACAACCCT TCCTTTGGCT ATGGTGGATA CTGCCTGCCA
AAAGATACCA AACAACTTCT AGCAAACTTC GACAAGGTCC CACAAAACAT GATCTCCGCA
ATCGTAGACT CCAATAGGAC CAGAAAGGAC TTCATAGCAG ATCAAGTCCT AAACATAGCA
GGCTACTACG ATTACAATTC AGACGACCAG TATCAACCAG AAATGGAAAA AGACTGTGTA
ATAGGAGTCT ACAGACTCAC CATGAAGTCA AACTCAGACA ACTTCCGCCA ATCCTCTATC
CAAGGAGTTA TGAAAAGAAT CAAGGCCAAG GGAGCAAAGG TAATAATCTA CGAACCAACC
CTAGAAGACG GTGACACCTT CTTTGGATCT TTAGTAGTAA ACAACCTAAA CAAATTCAAA
AAAATGAGCC AGGCAATAAT AGCCAACAGG TACGACGAGA GCCTAGACGA TGTGATGGAG
AAGGTATACA CGAGGGATAT ATTTAAGAGA GACTAG
 
Protein sequence
MKKLSIEKLA DTVINKRKEK GITQKSLADT TGINRAMISR LESCDYTPSI DQLQAIGEVL 
DFEVVDMFEE ETYEKEIKSD KKYKIAVAGT GYVGMSIATL LSQHNEVTAV DIVEEKVEKI
NNKISPIQDD YIEKYLEEKD LNLRATIDGE AAYKDADFVV IAAPTNYDSK KNFFDCSAVE
DVIELVLKVN PEATMIIKST IPVGYTREIR ERYETDKIIF SPEFLRESKA LYDNLYPSRI
IVSCDDQSRD KAEIFANLLK EGAIKKDIPT LFMGFTEAEA VKLFANTYLA LRVSYFNELD
TYAESKGLNT EEIINGVCLD PRIGTHYNNP SFGYGGYCLP KDTKQLLANF DKVPQNMISA
IVDSNRTRKD FIADQVLNIA GYYDYNSDDQ YQPEMEKDCV IGVYRLTMKS NSDNFRQSSI
QGVMKRIKAK GAKVIIYEPT LEDGDTFFGS LVVNNLNKFK KMSQAIIANR YDESLDDVME
KVYTRDIFKR D