Gene ECD_00069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00069 
SymbolthiP 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp75716 
End bp77326 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID 
Productthiamin ABC transporter membrane protein 
Protein accessionACT41970 
Protein GI253976300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCACC 
ACGCTGGTGG TAGCGGTTGC GCTGGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GATGACTGGG TGGCAGTCTG GCAGGACAGC TATCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTGCTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCG
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTA
ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG
CTGGCAACAC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTACCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATTCCCGG CGAACAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTCGT CGAATGGCCG TGGTTACGGC GACAAATCCC GCCGGTTGCT
GCACTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCGCT GGGCGGCGGT
CCGCAGGCAA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTCGGGTT GGTGCTGCTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATTGTGCT GGCGCTGCTG
CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGC TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTTCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATCCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC CATTGGCCTG
CCGCAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG
TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTCG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC TTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTGGTG
GCGTTGTTCG GTAACGATGA TTTCCGCACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTC TGCTGCTACT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAGATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAT TLVVAVALAA FLALWWNAPQ DDWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LATLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGLNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RDVKTD