Gene ECD_10056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_10056 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp788125 
End bp789330 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content58% 
IMG OID 
ProductTail fiber protein 
Protein accessionACT42638 
Protein GI253976968 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAA AGATTTCAGG AGTCCTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC 
ACCATTCAGC TGAAAGCCAG ACGTAACAGC ACCACGGTGG TGGTGAACAC GGTGGGCTCA
GAGAATCCGG ATGAAGCCGG GCGTTACAGC ATGGATGTGG AGTACGGTCA GTACAGTGTC
ATCCTGCAGG TTGACGGTTT TCCACCATCG CACGCCGGGA CCATCACCGT GTATGAAGAT
TCACAACCGG GGACGCTGAA TGATTTTCTC TGTGCCATGA CGGAGGATGA TGCCCGGCCG
GAGGTGCTGC GTCGTCTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGTGGTG
GCACAGAGTA CGGCAGACGC GAAGAAATCA GCCGGCGATG CCAGTGCATC AGCTGCTCAG
GTCGCGGCCC TTGTGACTGA TGCAACTGAC TCAGCACGCG CCGCCAGCAC GTCCGCCGGA
CAGGCTGCAT CGTCAGCTCA GGAAGCGTCC TCCGGCGCAG AAGCGGCATC AGCAAAGGCC
ACTGAAGCGG AAAAAAGTGC CGCAGCCGCA GAGTCCTCAA AAAACGCGGC GGCCACCAGT
GCCGGTGCGG CGAAAACGTC AGAAACGAAT GCTGCAGCGT CACAACAATC AGCCGCCACG
TCTGCCTCCA CCGCGGCCAC GAAAGCGTCA GAGGCCGCCA CTTCAGCACG AGATGCGGTG
GCCTCAAAAG AGGCAGCAAA ATCATCAGAA ACGAACGCAT CATCAAGTGC CGGTCGTGCA
GCTTCCTCGG CAACGGCGGC AGAAAATTCT GCCAGGGCGG CAAAAACGTC CGAGACGAAT
GCCAGGTCAT CTGAAACAGC AGCGGAACGG AGCGCCTCTG CCGCGGCAGA CGCAAAAACA
GCGGCGGCGG GGAGTGCGTC AACGGCATCC ACGAAGGCGA CAGAGGCTGC GGGAAGTGCG
GTATCAGCAT CGCAGAGCAA AAGTGCGGCA GAAGCGGCGG CAATACGTGC AGAAAATTCG
GCAAAACGTG CAGAAGATAT AGCTTCAGCT GTCGCGCTTG AGGATGCGGA CACAACGAGA
AAGGGGATAG TGCAGCTCAG CAGTGCAACC AACAGCACGT CTGAAACGCT TGCTGCAACG
CCAAAGGCGG TTAAGGTGGT AATGGATGAA ACGAACAGAA AAGCCCACTG GACAGTCCGG
CACTGA
 
Protein sequence
MAVKISGVLK DGTGKPVQNC TIQLKARRNS TTVVVNTVGS ENPDEAGRYS MDVEYGQYSV 
ILQVDGFPPS HAGTITVYED SQPGTLNDFL CAMTEDDARP EVLRRLELMV EEVARNASVV
AQSTADAKKS AGDASASAAQ VAALVTDATD SARAASTSAG QAASSAQEAS SGAEAASAKA
TEAEKSAAAA ESSKNAAATS AGAAKTSETN AAASQQSAAT SASTAATKAS EAATSARDAV
ASKEAAKSSE TNASSSAGRA ASSATAAENS ARAAKTSETN ARSSETAAER SASAAADAKT
AAAGSASTAS TKATEAAGSA VSASQSKSAA EAAAIRAENS AKRAEDIASA VALEDADTTR
KGIVQLSSAT NSTSETLAAT PKAVKVVMDE TNRKAHWTVR H