Gene SeD_A0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0401 
SymbolprpC 
ID6872225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp422287 
End bp423456 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID642783633 
Productmethylcitrate synthase 
Protein accessionYP_002214320 
Protein GI198243776 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.600733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000000076827 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACA CGACGATCCT GCAAAACAAC ACGCATGTCA TTAAGCCTAA AAAATCGGTC 
GCGCTTTCCG GCGTACCTGC CGGAAATACC GCTCTGTGTA CCGTGGGAAA AAGCGGTAAC
GATCTGCACT ATCGCGGTTA CGACATTCTC GATCTGGCGG AGCACTGCGA ATTTGAAGAA
GTCGCGCATT TACTGATCCA CGGAAAATTA CCGACCCGCG ACGAGCTGAA CGCGTATAAA
AGCAAGTTAA AAGCGCTGCG CGGATTACCC GCCAATGTCC GTACCGTTCT GGAAGCGTTA
CCGGCGGCCT CGCACCCGAT GGACGTGATG CGTACCGGCG TTTCCGCGCT GGGCTGCACC
CTGCCGGAAA AAGAGGGACA TACCGTCTCC GGCGCGCGCG ATATTGCCGA TAAGCTGTTG
GCCTCGCTCA GCTCTATCCT TCTTTACTGG TATCACTACA GCCACAACGG CGAACGTATT
CAGCCGGAAA CCGACGATGA TTCCATCGGC GGCCATTTCC TGCATCTGCT GCACGGTGAA
AAGCCAACCC AAAGCTGGGA AAAGGCGATG CATATTTCGC TGGTGCTGTA TGCCGAGCAT
GAGTTCAACG CCTCGACGTT TACCAGCCGG GTGATTGCCG GAACCGGCTC GGATGTCTAC
TCCGCGATTA TCGGCGCGAT TGGCGCGCTG CGCGGCCCGA AACACGGCGG GGCGAATGAG
GTGTCGCTGG AAATTCAACA GCGTTATGAA ACGCCGGACG AGGCAGAGGC GGATATCCGT
AAACGCGTGG AAAACAAAGA GGTGGTGATT GGCTTTGGAC ATCCGGTTTA CACCATCGCC
GACCCGCGCC ATCAGGTGAT CAAACGGGTG GCGAAACAGC TTTCAGAAGA AGGCGGCTCG
CTGAAGATGT ACCACATCGC TGACCGTCTG GAAACGGTGA TGTGGGAGAC AAAAAAGATG
TTCCCGAATC TCGACTGGTT TTCGGCGGTC TCCTACAACA TGATGGGCGT CCCTACCGAA
ATGTTCACCC CGCTGTTTGT CATCGCCCGC GTTACCGGCT GGGCGGCGCA CATTATTGAA
CAGCGTCAGG ACAACAAAAT TATTCGCCCC TCTGCCAACT ATACCGGCCC GGACGATCGC
CAGTTTGTGC CGATCGAAAA GCGTTGCTAA
 
Protein sequence
MTDTTILQNN THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE 
VAHLLIHGKL PTRDELNAYK SKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT
LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE
KPTQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDVY SAIIGAIGAL RGPKHGGANE
VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSEEGGS
LKMYHIADRL ETVMWETKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE
QRQDNKIIRP SANYTGPDDR QFVPIEKRC