Gene ECD_01508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01508 
SymbolstfR 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1585034 
End bp1586335 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content40% 
IMG OID 
Productpredicted tail fiber protein 
Protein accessionACT43385 
Protein GI253977715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.574166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGCCGC TTCCCGATGT GTGGATACCG TTTAACGATT CACTGGATAT GCTTGCTGGC 
TTTTCGCCTG GTTATAAGCA AATAACTGTA GGTGATGATG TTATTAAAAT GCCATCCGAT
AAGGTTGTTA GCTTCAAACG CGCATCAGGT GCAACATACA TTAATAAATC AGGAGTATTA
ACCGTTGCTG AAGTTGACGA ACCGCGATTT GAACGAGAAG GTTTGCTGAT TGAAGGACAA
AGAACCAACT ATCATCTTAA TTCACTTACG CCATCTAAGT GGGGAGCTAC AACAAGTGTA
ACTATAACAG AAAGTGGTGT TGATGAGTTT GGCTTTACTT ATGGGCGGTT TCAAATAAAG
GACGAAAAAA TTGGGACAAA TACGACAATG AATATCGCTG CGGTTTCAGG AGGAAGAGGT
GTCGATGTTA CTGGAACTGA AAAGTATGTT ACAACATCAT GTCGTGTAAA AAGCGATAGT
GCTAATATAC AATGTCGTAT AAGATTTGAA AGATATGACG GGTCCGCATA TTTTTATCTG
GCAGATGCAT ATCTTAATAT AACAGATATG TCCATTAGGA AAACGGGAGG AGGGGCTGCA
AGAATAACCG CCCGAGCGGA GAAAGAATCT AATGGATGGA TTTATTTCGA GGTTACATAT
CAATCTGAAG CTATTGATAA TATGGTTGGC TCTCAGATCC AAATTGCTCC ACCTGTTTCA
CCTGGAACTT ATTTGGGCGG GGAATATTTG GATGTTACGA CACCACAATT TGAAGGCGGC
TCATGCGCAT CATCTTTTAT CATTTCCGAT ACAGTTGCAT CAACGCGAGC AAGCGATATT
GTTACATTGC CTTGTAAAAA TAACATGGCC AGCAAACCTT TAACCTGCAT GGTTGAAGTG
AATAAAAATT GGTCTATAGC ACCAAATTCC GCGCCTAGAA TTTATGATAT AACAGGATTT
AAAACAAAAG ACGACGCTTT TGTTTTTGCA TTCAGAAATA CAGCAGGTAG TGTAGGAACT
CCATATGTTC AATTTGGTAA TCCAATATCA TTTCCACCTG GAAATTACCC AAGAAAGATT
ATCGCTGTAT ATAGAATAAA AAGCGATGGC AAGTTTCAGG CTGGCTGCAA TGGGGTTTTA
TCAACACCAG CATCAACAAC GTGGAAGAGT GTTAGTGGTG CTACAGGTAT AAGGATTGGA
GGCCAGACTA CAGCCGGCTT ACGTCATTTA TTTGGTTATA TCAGGAATTT TAGAATATGG
CATAAAGAAT TAACCGATGC GCAAATGGGA GAGATAATAT AA
 
Protein sequence
MQPLPDVWIP FNDSLDMLAG FSPGYKQITV GDDVIKMPSD KVVSFKRASG ATYINKSGVL 
TVAEVDEPRF EREGLLIEGQ RTNYHLNSLT PSKWGATTSV TITESGVDEF GFTYGRFQIK
DEKIGTNTTM NIAAVSGGRG VDVTGTEKYV TTSCRVKSDS ANIQCRIRFE RYDGSAYFYL
ADAYLNITDM SIRKTGGGAA RITARAEKES NGWIYFEVTY QSEAIDNMVG SQIQIAPPVS
PGTYLGGEYL DVTTPQFEGG SCASSFIISD TVASTRASDI VTLPCKNNMA SKPLTCMVEV
NKNWSIAPNS APRIYDITGF KTKDDAFVFA FRNTAGSVGT PYVQFGNPIS FPPGNYPRKI
IAVYRIKSDG KFQAGCNGVL STPASTTWKS VSGATGIRIG GQTTAGLRHL FGYIRNFRIW
HKELTDAQMG EII