Gene Emin_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0996 
Symbol 
ID6262798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1086451 
End bp1088076 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content39% 
IMG OID642611476 
Productphosphoribosylglycinamide synthetase 
Protein accessionYP_001875886 
Protein GI187251404 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000037072 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAAAG GCAAAGTACT TATAGTTGAC GGATACTCCA CAGGTAAATA TTATGCTGAC 
CGCCTAAAAG AAAGAGGCAT TACTCCTTTT CACCTTACTT CGGGGATGGA AAAAAATACT
TCACTTCCGC AAGATGTTAT TGAAAAATAC ATTGCCTCGC AAATAGGGAC AAGTTATCAA
ACAACTTATC TCATGCCTGA TTCACTCCAG TCCTTGTTGG AAGAATTTGG CAAACATAAT
TTCGCCGCTG TTATACCCGG TACGGAAAGC GGCGTGGAAG TAGCGGAACG TTTGTCCGAA
TACTTTAAAC TGCCTTCCAA TGATTTTAAG ACAGTTGCCC TTAGAAGGGA TAAGCACCTT
ATGCAGCAGG CTCTTAAAAA CGCGGGCTTA AAATATATCT CTTTTTTAAA AACAGCCAAG
GTTGAAGAGG CTCTTTCCTG GATTGAAAAA AATAATTTTA AAAAAATTGT TATAAAACCG
CTTATGAGCG CGGGAACTGA CGGTGTTAAA GTTTGTGAAG ATAAGGACAG CGTAAAAAAG
GCTTTTGAGT CTTTGATAGG CACAAAAGAC GGCTTTGGCA GAAAAAATGA CGAGGTTTTG
GTTGAGCAGT TTATAGAAGG CAAAGAAATT GTTGTTAACT GCGTTTCGCG CGGGGGGGAG
CATATTTTAA CTGATGTAAT GATTTACAAC AAAATATTAA CCGTTGATAA AAACCCTGTA
TACGACGCTT CTCTTTTAAT AAAAAACCTT ACTCCGGAAT TTAAAGAATG CGTAGATTAT
ACTTTTAAAG TTCTTAATGT TTTGGGTATT AAATACGGCG CGTCCCACAC GGAAATAATG
CTTACACCCG AAGGCCCGGT TCTTATAGAA ACAGGCGCCA GAGTTATGGG AAGGCTTAGC
GAAATTTATT GGGAAGCCTT GGGCCGCAAC AGCATTGATT TGATTCTTGA CAGCTATCTT
GACGGCGTAA AGCATAAAGA AAATATGCTT AAGCCTTATA ATCCCAAAAA ATCTTTTCTT
TATAAATATT TTATTTCTTA TGCGAATGCG GAAATATCTT CCCTTCCTGT TTTTGACAGT
TTAGGGGAGC TTCCCTGCGT TAGAGAGCTG ACTTTTGCGC TTGCCCGGCA AAGTATGCGC
GTTAAAAAAA CTATTGATAT GCCTACAATG CCCGGTGAAG GTGTTTTTAT AAGCGAGCAA
GAGGAAGAGA TTATAAACGC CTATAAAACC GCGCGATTTT TAGAAGTTTG CGCGCCCGGC
CTTTTGTATG AGCCTAAAGA CGCCGTGCCT ATGGCTTTTG AAAAGGAACT TTTAGCAAAA
ATTAAGGATA AAGGCTCTTT ATGTGATGAA TATGACACTC TTTTTAAAGG TTTTGAATTA
AAATATAAAG ATTTTGAAAA CAGTGTTTTA TATGTTTTAA ATGACTTAAT GCTGCCCTGC
CTTAACGGTG AAATTGGAGA TGTGTATATA GCGGGCAAAC AGCTTTATGA GGATAAACGG
ACTAAAGAAA CTCTTATCCC TAATCCGTTT TACAGCGTTT TTCTTCCCGT GGCCGCGGAG
CCCTTTTTAT TTAAAACAGG AGATAAAGGC CGCATCGAAA AAACCGGGGA AATAAAAATT
TTATAA
 
Protein sequence
MDKGKVLIVD GYSTGKYYAD RLKERGITPF HLTSGMEKNT SLPQDVIEKY IASQIGTSYQ 
TTYLMPDSLQ SLLEEFGKHN FAAVIPGTES GVEVAERLSE YFKLPSNDFK TVALRRDKHL
MQQALKNAGL KYISFLKTAK VEEALSWIEK NNFKKIVIKP LMSAGTDGVK VCEDKDSVKK
AFESLIGTKD GFGRKNDEVL VEQFIEGKEI VVNCVSRGGE HILTDVMIYN KILTVDKNPV
YDASLLIKNL TPEFKECVDY TFKVLNVLGI KYGASHTEIM LTPEGPVLIE TGARVMGRLS
EIYWEALGRN SIDLILDSYL DGVKHKENML KPYNPKKSFL YKYFISYANA EISSLPVFDS
LGELPCVREL TFALARQSMR VKKTIDMPTM PGEGVFISEQ EEEIINAYKT ARFLEVCAPG
LLYEPKDAVP MAFEKELLAK IKDKGSLCDE YDTLFKGFEL KYKDFENSVL YVLNDLMLPC
LNGEIGDVYI AGKQLYEDKR TKETLIPNPF YSVFLPVAAE PFLFKTGDKG RIEKTGEIKI
L