Gene SeD_A4696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4696 
SymboldcuB 
ID6871850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4561859 
End bp4563199 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content54% 
IMG OID642787594 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_002218192 
Protein GI198245398 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases
[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTA GTATACAGCT TCTCATAATA TTAATATGTC TGTTTTATGG TGCCCGAAAG 
GGCGGGATCG CGCTCGGGTT GTTGGGTGGT ATCGGTCTGG TCATTCTGGT GTTTGTTTTC
CATCTCCAGC CAAGCAAACC GCCCGTTGAC GTAATGCTGG TCATTATCGC GGTAGTCGCC
GCGTCGGCGA CGTTGCAGGC GTCAGGCGGG CTGGATGTGA TGCTGCAGAT TGCCGAAAAG
CTGCTGCGTC GCAACCCCAA ATACGTCTCT ATTGTGGCGC CGTTCGTCAC CTGTACCCTG
ACGATTCTGT GTGGGACAGG CCACGTGGTC TACACCATTT TGCCGATTAT CTATGACGTG
GCGATCAAGA ATAATATCCG TCCGGAACGT CCAATGGCGG CCAGTTCTAT CGGCGCGCAA
ATGGGCATCA TCGCCAGTCC GGTTTCCGTC GCCGTGGTTT CTCTGGTAGC GATGCTGGGC
AACGTGACAT TTGACGGAAA ACATCTGGAG TTCCTCGATC TGCTGTCGAT CACCATCCCG
TCTACCCTGC TCGGTATCCT GGCAATCGGT ATTTTTAGTT GGTTCCGCGG TAAAGATCTG
GATAAAGACG AAGCGTTTCA GAAATTTATT TCCGTACCGG AAAACCGTCA GTACGTGTAC
GGCGATACCG CGACGCTGCT GGATAAAAAA CTGCCGAAAA GCAACTGGCT GGCGATGTGG
ATCTTCCTGG CGGCGATTGC CGTGGTCGCT CTCCTGGGCG CGGACTCCGA CTTACGTCCA
ACCTTCGGCG GCAAACCGTT GTCGATGGTG CTGGTCATTC AGATGTTTAT GCTGCTGACC
GGGGCGCTCA TTATCATCCT GACCAAAACC AATCCTGCGT CTATCTCAAA AAACGAAGTT
TTTCGTTCCG GTATGATTGC GATTGTCGCG GTATACGGGA TCGCCTGGAT GGCGGAAACC
ATGTTCGGCG CGCATATGTC GGAAATTCAG GGCGTGCTGG GCGAAATGGT CAAAGAGTAT
CCGTGGGCCT ACGCCATCGT TCTGCTGCTG GTCTCCAAGT TTGTTAACTC CCAGGCAGCG
GCGCTGGCGG CGATTGTTCC CGTCGCGCTG GCTATCGGTG TCGATCCGGC GTATATCGTG
GCCTCTGCGC CGGCATGTTA TGGCTACTAT ATCCTGCCGA CCTACCCAAG CGATCTGGCG
GCGATTCAGT TTGACCGTTC CGGCACAACC CGTATTGGCC GCTTCGTCAT TAACCACAGC
TTCATTCTGC CGGGTTTGAT TGGCGTGAGC GTCTCCTGCG TCTTTGGCTG GATCTTTGCC
GCAATGTACG GATTCCTGTA A
 
Protein sequence
MLFSIQLLII LICLFYGARK GGIALGLLGG IGLVILVFVF HLQPSKPPVD VMLVIIAVVA 
ASATLQASGG LDVMLQIAEK LLRRNPKYVS IVAPFVTCTL TILCGTGHVV YTILPIIYDV
AIKNNIRPER PMAASSIGAQ MGIIASPVSV AVVSLVAMLG NVTFDGKHLE FLDLLSITIP
STLLGILAIG IFSWFRGKDL DKDEAFQKFI SVPENRQYVY GDTATLLDKK LPKSNWLAMW
IFLAAIAVVA LLGADSDLRP TFGGKPLSMV LVIQMFMLLT GALIIILTKT NPASISKNEV
FRSGMIAIVA VYGIAWMAET MFGAHMSEIQ GVLGEMVKEY PWAYAIVLLL VSKFVNSQAA
ALAAIVPVAL AIGVDPAYIV ASAPACYGYY ILPTYPSDLA AIQFDRSGTT RIGRFVINHS
FILPGLIGVS VSCVFGWIFA AMYGFL