Gene Dd1591_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_4133 
Symbol 
ID8120985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4662045 
End bp4663325 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID644854508 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003006408 
Protein GI251791687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACCG TACGCAAAGG GCTGTGGCGT GTGGGCGGTG CGCTGTGCGT GCTGTTGCTG 
ATCACACCGG CACAGGCCGT CACTGAGGTG AACTTCTGGT ATTCCGGCGG CACTAAACCG
CAAAAGATGA TGCTGACGCT GATCGACGAA TTTAACCGTA GTCAGGATCA ATACGTGGTG
AAAAGCGCAT TACAGGGTAA TTACGATGAA ACCTGGCAAA AATTGCAGGC CGGCATGGCG
GCGAAAAACG CCCCGGCGTT CGCGCTGTTG ACCGCCTTGC AGGGAACGGC GTTGGCGGAG
CGCAAACTGT TGCGCGATAT GCGCCCTTAC ATGGATAGCC GCTTCCGTTT CACCGATTTT
ATCGGTGCGT TCCGCCGTCA GGTTACCCGG CCGGACGGCA GCGTTTACGG CTTGCCTGCC
TACGGCACAA CGCAGGTGCT TTACTACAAC CAGGCGGTGC TGGCGCAGCA CGGCTTTACG
CCGGATGACC TGAAAACCTG GCAAGGACTG GCGAACGTGG CGGCGGCGGT CACCCGGAAA
GACGCCGGCG GCAACACCCA GTATTACGGT TGGGAGCCGA TGTGGGGGCC GGGCAATCTG
ATGGACGCCG CGTTGTCCAA CGGCGGGCGC ATCCTCAGCG AAGACGGCAA AAAAGTGCTG
ATCGATTCGC CGGCATGGGT GGAGGTGTGG GACAGCTTCC GCCGCTGGAT TCATGAAGAC
CGTATCATGC GTATCTACCA CGGCGGCCAG GGCTGGGAGT ACTGGTATAA AACCATCGAC
GATGTGATGA AAGATCGGGC GTTCGGGTAT ACCGGCTCAT CCGGCGATCA GGGCGATCTT
GATTTTCAGC GGCTGGCGGC GTTGCCTCAA CCGAGTTGGG GCAACAATCC GGCGGCACCG
CAGGCGGGCG CGCTGGTATT TGTGATGCCG ACCGACACGC CGGACGCGGC GGCGAAGGGC
GCTTTTGCGT TTATGCGTTT TTACACCAGC GCCGCCAATA CCGCCCGCTG GTCAATGTTC
ACGGGTTATA TTCCGGTACG CGAAAGCGTA TTGCAGGACG CGGGCTACCA GAAATACGTC
GCCGCCAACC CGCAGGCCGC CGTGCCGGTG CAGCAGGCGA AATCCGCCAG TATGGATTTT
ATCGACCCGA CCAAAGGCAA AATTCTGGAC GCGCTGACCG TTGCCGCCGA TCAGGTGGAA
ATCGAGAACA AACCGGCGCA GCAGGCATTG ACCGACGCGG CCCGTAAAGC GCAGAAAGCG
CTGGATCGCA TCAATGAATA A
 
Protein sequence
MMTVRKGLWR VGGALCVLLL ITPAQAVTEV NFWYSGGTKP QKMMLTLIDE FNRSQDQYVV 
KSALQGNYDE TWQKLQAGMA AKNAPAFALL TALQGTALAE RKLLRDMRPY MDSRFRFTDF
IGAFRRQVTR PDGSVYGLPA YGTTQVLYYN QAVLAQHGFT PDDLKTWQGL ANVAAAVTRK
DAGGNTQYYG WEPMWGPGNL MDAALSNGGR ILSEDGKKVL IDSPAWVEVW DSFRRWIHED
RIMRIYHGGQ GWEYWYKTID DVMKDRAFGY TGSSGDQGDL DFQRLAALPQ PSWGNNPAAP
QAGALVFVMP TDTPDAAAKG AFAFMRFYTS AANTARWSMF TGYIPVRESV LQDAGYQKYV
AANPQAAVPV QQAKSASMDF IDPTKGKILD ALTVAADQVE IENKPAQQAL TDAARKAQKA
LDRINE