Gene ECD_00043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00043 
SymbolcaiA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp43438 
End bp44580 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content54% 
IMG OID 
Productcrotonobetaine reductase subunit II, FAD-binding 
Protein accessionACT41944 
Protein GI253976274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTA ATTTAAATGA TGAGCAGGAA CTGTTTGTCG CCGGTATCCG CGAACTGATG 
GCCAGCGAAA ACTGGGAGGC CTATTTTGCC GAGTGCGACC GTGACAGCGT CTACCCGGAA
CGTTTTGTCA AAGCACTGGC GGATATGGGT ATCGACAGTC TGCTGATCCC TGAAGAGCAC
GGTGGTCTGG ACGCGGGGTT TGTTACTCTC GCCGCCGTGT GGATGGAGCT GGGACGTCTG
GGGGCACCAA CCTATGTGCT GTACCAGTTG CCGGGCGGGT TCAACACCTT CCTGCGCGAA
GGCACACAAG AGCAGATCGA CAAAATTATG GCTTTCCGCG GCACCGGTAA GCAGATGTGG
AACTCAGCGA TTACCGAACC GGGCGCGGGC TCCGACGTGG GTAGCCTGAA AACGACTTAT
ACCCGTAGAA ATGGTAAGAT TTATCTTAAT GGTAGTAAGT GTTTTATTAC CAGCAGCGCC
TACACCCCGT ACATCGTGGT GATGGCGCGC GACGGGGCTT CTCCGGACAA ACCTGTCTAC
ACCGAATGGT TTGTTGATAT GAGCAAACCG GGCATCAAAG TGACCAAACT TGAGAAGCTC
GGTCTGCGTA TGGATAGCTG CTGTGAAATC ACCTTTGACG ACGTGGAACT GGACGAGAAA
GACATGTTCG GTCGGGAAGG TAACGGCTTT AACCGCGTCA AAGAAGAGTT CGACCATGAA
CGTTTCCTGG TAGCCCTCAC CAACTACGGT ACGGCGATGT GCGCCTTTGA AGATGCGGCG
CGCTACGCCA ATCAGCGCGT GCAGTTTGGC GAGGCTATTG GTCGTTTCCA GTTGATTCAG
GAAAAATTCG CCCACATGGC GATCAAATTA AACTCCATGA AAAACATGCT GTATGAAGCA
GCGTGGAAAG CAGACAACGG CACCATCACC TCTGGCGATG CAGCGATGTG CAAATACTTC
TGCGCCAATG CGGCATTTGA AGTGGTGGAT AGCGCAATGC AGGTGCTGGG CGGTGTCGGG
ATTGCGGGCA ACCACCGCAT CAGCCGCTTC TGGCGTGACC TGCGTGTAGA CCGCGTTTCC
GGAGGATCTG ACGAAATGCA GATCCTGACG CTGGGTCGTG CGGTGCTGAA GCAATACCGC
TAA
 
Protein sequence
MDFNLNDEQE LFVAGIRELM ASENWEAYFA ECDRDSVYPE RFVKALADMG IDSLLIPEEH 
GGLDAGFVTL AAVWMELGRL GAPTYVLYQL PGGFNTFLRE GTQEQIDKIM AFRGTGKQMW
NSAITEPGAG SDVGSLKTTY TRRNGKIYLN GSKCFITSSA YTPYIVVMAR DGASPDKPVY
TEWFVDMSKP GIKVTKLEKL GLRMDSCCEI TFDDVELDEK DMFGREGNGF NRVKEEFDHE
RFLVALTNYG TAMCAFEDAA RYANQRVQFG EAIGRFQLIQ EKFAHMAIKL NSMKNMLYEA
AWKADNGTIT SGDAAMCKYF CANAAFEVVD SAMQVLGGVG IAGNHRISRF WRDLRVDRVS
GGSDEMQILT LGRAVLKQYR