Gene Dd1591_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_4031 
Symbol 
ID8119630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4550657 
End bp4552114 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content56% 
IMG OID644854408 
ProductTail Collar domain protein 
Protein accessionYP_003006308 
Protein GI251791587 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA AATATACGGC CTTGCTTACC CAGGTGGGAG CAGACAGATT GGCTAATGCG 
ATCGCGCTGG GAAAACAACT GGAAATCGCC CGAATGGGCG TAGGGGATGG TGGTGGTGTA
TTGCCAACGC CGGACGCAAC CCAGACTAAA TTGATCAATG AAAAACGTCG TGCCGCGCTT
AATTCGTTGA GCATCGACCC TGCTAATGCC AATCAGATTA TTGCCGAACA AGTGATCCCG
GAGAATGAAG GTGGATTCTG GCTGCGGGAA ATCGGCTTGT ATGATGCGGA TGATAATTTG
ATTGCCGTAG CCAACTGTCC GGAAACCTAT AAACCACAGA TGCAGGAAGG ATCGGGCCGC
GTGCAGACGG TGCGCATGAT TCTGGCTATT AGTCAGGCGC AAGCGGTATC GCTGAATATC
GACCCGGCGG TAGTGCTGGC AACCCGTAAG TCGGTTGATG ACAAGGCGAT TGAGGTGAAA
GCCTACGCCG ATGAACTGAT GGCGAAACAT CTCGCCGATG CGAATCCGCA TAAGCAGTAT
GCGCCGCTGG CTAGCCCTGC GCTTACCGGG GTGCCGACCG CCCCCACGGC GGCGGCTGGA
ACAAACACCA CGCAGTTGGC TACTACTGCA TTTGTCAAAA ACAATGCGGT GTGGGTATAT
GGGTCTTTAG CTGGGCTGGA TTTGAATACG TTAACTGGCT CCCGCGCTGG GCGGTTCTGG
CAGAATTTAA ATGCAGCAGC GACGGCGGCG CTCAATTACC CAGTTCAGTT TGCTGGCTCG
CTGGATGTTG AAAAGAACAC GGCAGACAGC GCGGAGGGGT GCATTCAACG ATATACAACT
TATGGCGGGG GGGCTCTCCC CCGTATGTTT ATTCGTTCGT ACAATGCGGG GAAACAAGTT
TGGGGGGCAT GGCAGGAGCT GGCCTCATTA TCCAGCCCAA CTTTCACCGG CACGCCGACG
GCGCCAACCG CAGAAGCAGG CTCTAACACT ACACAACTCG CAACGACCGC GTGGTTCGCA
GCAGAGATTG CGGGTATCCC GCTTCCCTGG CCGCAGGCGG CAGTGCCAAC CGGTTGGCTG
AAATGCAACG GTCAGGCATT CGATAAAAAC CGCTATCCAC GGCTGGCGCA GGTCTACCCG
TCGGGCGTGC TGCCGGATCT GCGTGGCGAA TTTATTCGCG GCTGGGATGA TGGGAGGGGG
GTGGATTCGG GGAGAGAAGT GCTCTCGCAG CAGAGAGGCT CTCTAATTAA CTACGATGGT
CCAGATTCAG CACCGACCTC GGACTCGCTA CGGCTGTCAG TATCAGCAGC ACAAGCTGAT
GCCGTCAGTG CGTCAGAGTA TGCCGGAGTG ATGCTGTCGT ACACGGCATA CAACATCACG
ACAGTAAGTG CCGCTGGCTA TGTCGGCGCT ACCCGCCCTC GCAACATTGC CTTTAACTAT
ATCGTGAGAG CAGCATAA
 
Protein sequence
MSTKYTALLT QVGADRLANA IALGKQLEIA RMGVGDGGGV LPTPDATQTK LINEKRRAAL 
NSLSIDPANA NQIIAEQVIP ENEGGFWLRE IGLYDADDNL IAVANCPETY KPQMQEGSGR
VQTVRMILAI SQAQAVSLNI DPAVVLATRK SVDDKAIEVK AYADELMAKH LADANPHKQY
APLASPALTG VPTAPTAAAG TNTTQLATTA FVKNNAVWVY GSLAGLDLNT LTGSRAGRFW
QNLNAAATAA LNYPVQFAGS LDVEKNTADS AEGCIQRYTT YGGGALPRMF IRSYNAGKQV
WGAWQELASL SSPTFTGTPT APTAEAGSNT TQLATTAWFA AEIAGIPLPW PQAAVPTGWL
KCNGQAFDKN RYPRLAQVYP SGVLPDLRGE FIRGWDDGRG VDSGREVLSQ QRGSLINYDG
PDSAPTSDSL RLSVSAAQAD AVSASEYAGV MLSYTAYNIT TVSAAGYVGA TRPRNIAFNY
IVRAA