Gene ECD_03994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03994 
SymboldcuB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4254484 
End bp4255824 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID 
ProductC4-dicarboxylate antiporter 
Protein accessionACT45784 
Protein GI253980114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTTA CTATCCAACT TATCATAATA CTGATATGTC TGTTTTATGG TGCCAGAAAG 
GGTGGTATCG CGCTGGGTTT ATTAGGCGGT ATCGGTCTGG TCATTCTGGT CTTCGTCTTC
CACCTTCAGC CAGGTAAACC ACCAGTTGAT GTCATGCTGG TTATCATTGC GGTGGTGGCG
GCATCGGCAA CCTTGCAAGC TTCGGGCGGT CTTGATGTCA TGCTGCAAAT TGCCGAGAAG
CTGCTGCGCC GCAACCCGAA ATATGTCTCA ATTGTCGCGC CGTTTGTGAC CTGTACGCTG
ACCATTCTTT GCGGTACGGG TCATGTGGTT TACACCATTC TGCCGATCAT CTACGACGTC
GCCATTAAGA ACAACATCCG TCCGGAACGT CCGATGGCGG CAAGTTCTAT CGGTGCACAG
ATGGGGATTA TCGCCAGTCC GGTGTCGGTT GCGGTCGTGT CTCTGGTTGC AATGCTGGGT
AATGTCACCT TTGATGGTCG CCATCTTGAG TTCCTCGACC TGCTGGCAAT CACCATTCCA
TCGACGTTAA TCGGTATCCT GGCGATCGGT ATCTTCAGCT GGTTCCGCGG TAAAGATCTG
GATAAAGACG AAGAGTTCCA GAAATTCATC TCCGTACCGG AAAACCGTGA GTATGTTTAC
GGTGATACCG CGACGCTGCT CGATAAAAAA CTGCCGAAAA GCAACTGGCT GGCAATGTGG
ATTTTCCTCG GGGCAATCGC TGTAGTCGCA CTTCTTGGTG CTGATTCGGA CCTGCGTCCA
TCCTTCGGCG GCAAACCGCT GTCGATGGTA CTGGTTATTC AGATGTTTAT GCTGCTGACC
GGGGCGCTGA TTATTATCCT GACCAAAACC AATCCCGCGT CTATCTCAAA AAACGAAGTC
TTCCGTTCCG GTATGATCGC CATCGTGGCG GTGTACGGTA TCGCATGGAT GGCAGAAACC
ATGTTCGGTG CGCATATGTC TGAAATTCAG GGCGTACTGG GTGAAATGGT GAAAGAGTAT
CCGTGGGCCT ATGCCATTGT TCTGCTGCTG GTTTCCAAGT TTGTAAACTC TCAGGCTGCG
GCGCTGGCGG CGATTGTTCC GGTCGCGCTG GCGATCGGCG TTGATCCGGC ATACATCGTG
GCTTCAGCAC CGGCTTGCTA CGGTTATTAC ATCCTGCCGA CTTATCCGAG CGATCTGGCA
GCGATTCAGT TTGACCGTTC CGGCACCACC CACATCGGTC GCTTCGTCAT CAACCACAGC
TTTATTCTGC CGGGGTTGAT TGGTGTGAGC GTATCGTGCG TCTTCGGCTG GATCTTCGCC
GCGATGTACG GGTTCTTATA A
 
Protein sequence
MLFTIQLIII LICLFYGARK GGIALGLLGG IGLVILVFVF HLQPGKPPVD VMLVIIAVVA 
ASATLQASGG LDVMLQIAEK LLRRNPKYVS IVAPFVTCTL TILCGTGHVV YTILPIIYDV
AIKNNIRPER PMAASSIGAQ MGIIASPVSV AVVSLVAMLG NVTFDGRHLE FLDLLAITIP
STLIGILAIG IFSWFRGKDL DKDEEFQKFI SVPENREYVY GDTATLLDKK LPKSNWLAMW
IFLGAIAVVA LLGADSDLRP SFGGKPLSMV LVIQMFMLLT GALIIILTKT NPASISKNEV
FRSGMIAIVA VYGIAWMAET MFGAHMSEIQ GVLGEMVKEY PWAYAIVLLL VSKFVNSQAA
ALAAIVPVAL AIGVDPAYIV ASAPACYGYY ILPTYPSDLA AIQFDRSGTT HIGRFVINHS
FILPGLIGVS VSCVFGWIFA AMYGFL