Gene Dhaf_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3559 
Symbol 
ID7260577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp3782708 
End bp3783733 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content48% 
IMG OID643563482 
Productaminodeoxychorismate lyase 
Protein accessionYP_002460013 
Protein GI219669578 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000155777 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGAC GCTGGTTAAA AGGTCTGCTC AGTACTTTAT TTATTATGGC AGTTCTTGCG 
GGAGCAGGAA TCGCAGCTTG GTGGAACTGG GCGAGCCAAC CCTACGCTGA GGAAGGAAGC
AATGCTGCAG AAGTGCAGTT TATGATAACT CCGGGAATGA ATGCCTCTCA AGTTGCTCAG
GAACTGGAAC ATCAGGGGCT TATCCGCAAT GCCCTGGCCT TTCGCTTTTT GGCCAGTCAG
CAAAATGTGG ATTCCAAGCT GCTGGCGGGA GAGTACCAGC TTTCCGCCCA AATGCCCCCC
CAGGAAATGA TTAATAAGAT TCTTGAGGGA CCTGACGTGC ATACTGTAAA GGTCACCATT
CCCGAGGGAT ATACGACAGC CCAGATTATT GATCTATTTG TAAAGAATGA CTTGGGAAGC
AAAGAGGATT ATCAGAGGGT TATTGAAAGC GAGCCTTTCA GTTATTCTTT TCTTGCCGAT
ATCCCTGCGG GACCGAACCG GCTGGATGGT TTTCTTTTTC CTGATACCTA CTTCTTTGCT
CCGGAGGCCG GTCCTAAGGA AAACATCAAT CGAATGCTTA AACGCTTTGA ACAGGAAATA
ACCCCGGAAG TGATGACTAA ATTGGCAGAA ATGAATCTTA CGCTGCGGGA GTGGGTGAAT
CTCGCTTCCA TCGTAGAAAA GGAAGCAGGC AAGGATGCGG ACCGTCCGAT TATCGCCGGA
ATTTTCCTTA ATCGCCTCAA GATCGACATG GCCCTCCAAT CCTGTGCCAC CATTCAATAT
GTACTGGGAA CTCAGAAATA TATCCTCTCT TTAGAAGATA TCCAGGTGGA GTCTCCTTAT
AACACCTATA AGTATCCGGG ATTGCCGCCC AGCCCCATTG CCAGTCCTGG GCATGCCTCT
CTGGATGCAG TGCTCAACAG CACGGATTCC GATTACCTAT ACTTCTTAGC TACTCCAAGT
GGTGAGACGA TTTATGCGAA AACCCATCAG GAGCATTTGC AGAATCAGGC CAAGTATATG
AATTAA
 
Protein sequence
MGRRWLKGLL STLFIMAVLA GAGIAAWWNW ASQPYAEEGS NAAEVQFMIT PGMNASQVAQ 
ELEHQGLIRN ALAFRFLASQ QNVDSKLLAG EYQLSAQMPP QEMINKILEG PDVHTVKVTI
PEGYTTAQII DLFVKNDLGS KEDYQRVIES EPFSYSFLAD IPAGPNRLDG FLFPDTYFFA
PEAGPKENIN RMLKRFEQEI TPEVMTKLAE MNLTLREWVN LASIVEKEAG KDADRPIIAG
IFLNRLKIDM ALQSCATIQY VLGTQKYILS LEDIQVESPY NTYKYPGLPP SPIASPGHAS
LDAVLNSTDS DYLYFLATPS GETIYAKTHQ EHLQNQAKYM N