Gene EcolC_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3001 
Symbol 
ID6065940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3280686 
End bp3281663 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content47% 
IMG OID641602418 
ProductSel1 domain-containing protein 
Protein accessionYP_001725953 
Protein GI170020999 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTCA CGTCAAGTTG CTGCGATAAT TTATCAATAG ATGAGATTAT CGAACGTGCT 
GAAAAAGGCG ATTGCGAGGC TCAATATATT GTTGGGTTTT ATTATAATCG CGATAGCGCA
ATTGATTCTC CGGACGACGA AAAAGCCTTT TACTGGCTGA AGCTGGCCGC TGAGCAAGGT
CATTGTGAAG CACAGTATTC CTTAGGGCAA AAGTATACCG AGGATAAAAG CCGTCATAAA
GATAATGAGC AAGCCATCTT CTGGCTGAAA AAAGCTGCCC TACAAGGCCA TACTTTCGCT
TCCAACGCCC TTGGCTGGAC ACTGGATCGT GGAGAAGCCC CCAATTATAA AGAAGCGGTT
GTCTGGTATC AGATAGCCGC GGAGAGCGGA ATGTCTTATG CGCAAAATAA TCTTGGGTGG
ATGTACAGAA ATGGCAACGG AGTCGCAAAA GACTATGCGC TGGCATTTTT CTGGTACAAA
CAAGCTGCAT TACAAGGCCA TAGTGACGCG CAAAACAATC TGGCCGATCT TTATGAAGAC
GGAAAAGGCG TTGCTCAAAA CAAGACACTC GCCGCATTCT GGTATTTGAA AAGCGCACAG
CAGGGTAATC GGCACGCCCA GTTTCAAATT GCATGGGATT ATAACGCTGG CGAAGGGGTG
GACCAGGACT ATAAGCAAGC GATGTACTGG TATCTGAAGG CTGCCGCTCA GGGGAGCGTC
GGCGCTTACG TCAACATCGG TTATATGTAT AAACACGGAC AAGGCGTTGA GAAAGATTAT
CAGGCTGCCT TTGAATGGTT TACGAAAGCC GCTGAATGCA ATGACGCCAC TGCCTGGTAT
AACCTGGCCA TTATGTATCA CTACGGAGAA GGAAGACCTG TCGATCTCCG ACAGGCTCTC
GACCTGTATC GTAAAGTTCA GTCATCCGGA ACCAGAGATG TCAGTCAGGA AATTCGTGAG
ACTGAAGATT TACTGTAG
 
Protein sequence
MIFTSSCCDN LSIDEIIERA EKGDCEAQYI VGFYYNRDSA IDSPDDEKAF YWLKLAAEQG 
HCEAQYSLGQ KYTEDKSRHK DNEQAIFWLK KAALQGHTFA SNALGWTLDR GEAPNYKEAV
VWYQIAAESG MSYAQNNLGW MYRNGNGVAK DYALAFFWYK QAALQGHSDA QNNLADLYED
GKGVAQNKTL AAFWYLKSAQ QGNRHAQFQI AWDYNAGEGV DQDYKQAMYW YLKAAAQGSV
GAYVNIGYMY KHGQGVEKDY QAAFEWFTKA AECNDATAWY NLAIMYHYGE GRPVDLRQAL
DLYRKVQSSG TRDVSQEIRE TEDLL