Gene Dd1591_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_1823 
Symbol 
ID8117306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp2084754 
End bp2085959 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID644852215 
ProductTail Collar domain protein 
Protein accessionYP_003004153 
Protein GI251789432 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAA AATACTTTGC TTTACTGACG AATACCGGCG CGGCGAAATT AGCTAACGCC 
ACGGCGCTGG GTAGCCACCT GGCCATCACG CAAATGGCGG TCGGCGATGG CGGCGGCAGC
CTGCCTACTC CGACGCCGGC CCAAACCAAA CTGATCAATG AGAAACGCCG CGCTACGCTC
AACGCACTGA GTGTTGACCC TAAAAACCCC AACCAAATCA TTGCCGAGCA GGTTATTCCT
GAAAATGAGG GCGGTTGGTG GATCCGTGAA ATCGGTTTGT ATGACAGCGA CGGTGATCTG
GTTGCCGTCG CAAACTGCGC CGATACCTAT AAGCCACTGC TACAGGAAGG CTCAGGCCGG
GTACAAACCG TACGCATGAT CCTGATTGTC AATAGCGCCG ACGCCGTTAC CCTGAAAATC
GACCCAGCTG TGGTACTGGC GACCCGCCAG TATGTCGATG ATACGGCTGT TGAGGTCAAG
ACATACGCCG ACAGCCAACT CAACGCACAT GTCGCCGCCG CCAATCCCCA CCCGCAATAT
GCTCCACTCA GCAGCCCCGC GCTGACCGGC GTACCTACCG CGCCGACGGC GGCCAACAGC
GCGAACTCCA CGCAACTGGC GACCACCGCA TTTGTCAAAA ACACGGCATT GCTTAAAGAA
CAAAACGGTG CGGATATCGC GAATAAATCG GCCTTTCTCG CAAACCTGGG TTTAAGCGAC
ACGCTGAAAA TCGCGGATAT CGTCGGCATT CCGTTACCCT GGCCGCAAGC CACACCGCCC
GCCGGCTGGC TGAAATGCAA CGGTCAGGCG TTCGACAAAA ACGCCTTCCC GAAACTGGCG
CAGGCTTACC CCGGCGGCGT GCTGCCGGAT CTGCGCGGTG AATTTATTCG CGGCTGGGAT
GATGGGCGCG GGGTGGATGT TGCCCGTGAA TTGCTAAGTT GGCAAAAAGG AACTCTGACT
ATTTCAGATC CCAACTTAAG TGCAGTAAAT GTAGGGGCAT TGATACATGC AAACAACGAT
AGTGCCAATA CATACAAATC TATGGGGTTT GATATTGTCA ACAAAAGCGA TTATGCCATG
CTACGTGCCG CTATAAATGT GGAGACGGTG GGGGCACAAG ATTTGGATTC AAATGGTTGG
CAGTTTGGTT ATGGAGCTAC CCGCCCCCGC AATATCGCCT TTAACTACAT CGTCAGAGCA
GCCTGA
 
Protein sequence
MSTKYFALLT NTGAAKLANA TALGSHLAIT QMAVGDGGGS LPTPTPAQTK LINEKRRATL 
NALSVDPKNP NQIIAEQVIP ENEGGWWIRE IGLYDSDGDL VAVANCADTY KPLLQEGSGR
VQTVRMILIV NSADAVTLKI DPAVVLATRQ YVDDTAVEVK TYADSQLNAH VAAANPHPQY
APLSSPALTG VPTAPTAANS ANSTQLATTA FVKNTALLKE QNGADIANKS AFLANLGLSD
TLKIADIVGI PLPWPQATPP AGWLKCNGQA FDKNAFPKLA QAYPGGVLPD LRGEFIRGWD
DGRGVDVARE LLSWQKGTLT ISDPNLSAVN VGALIHANND SANTYKSMGF DIVNKSDYAM
LRAAINVETV GAQDLDSNGW QFGYGATRPR NIAFNYIVRA A