Gene Dshi_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3898 
Symbol 
ID5714427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp121824 
End bp123404 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content65% 
IMG OID641276811 
Productalpha amylase catalytic region 
Protein accessionYP_001542107 
Protein GI159046436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.237178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAC GCGGGCCGTG GCCCGAAAAC CCCGTCATTT ATCAGGTCTA CCCCCGGTCG 
TTCCTTGACA CGACCGGGAC GGGGGAAGGC GATCTGCCGG GGGTGACCCG GCAGCTCGAT
TACATTGCCG GCCTCGGGGT GGACGGCATC TGGCTTTCGC CCTTCTATCC CTCGCCGTTC
TGCGACGGGG GGTATGACAT TGCCGATCAT TGCGCCGTCG ACCGGCGGTT CGGCACCCTC
GACGATTTCG ATGCGCTGGT GGCGCGGGCC CATGATCTGG ACCTGCGTGT GATGATCGAT
CTGGTGCTCA ACCACACGTC GGACACCCAT GACTGGTTCG CAAAATCGCT GGCCCGTGAA
GAAGGCTTCG AGGATGTCTA CATCTGGGCG GACCCGTGCA AGGACGGCAG CCCGCCCTCG
AACTGGCTGT CGTTTTTCGG AGAGGCCGCT TGGCGCTGGC ACCCGCAACG TGCGCAATAC
TGCCTGCACA AGTTTCTGCC CTGTCAGCCC TGCCTGAACC ATTACAACGA CCGCGTGCAC
GAACGGCTGA ACCGGATCAC GCGGTTCTGG CGCGACCGTG GCGTCGATGG CTTCCGCTAT
GACGCGGTAA CGAGCTTTTT CTATGACCCC GGGTTTCGCG ACAATCCCCC CGCGGCCGAG
GCCGAAGCGG CTCTGATCCC CGGGCCATCC AACAATCCAT ATACCTTCCA GGAGCATATT
CACGACGTGC TGCCCAACGA ATGCGCTGCC TTCGCGGAAA CCCTGCGCGA GATGGCAGGC
CCCGACGCCT ACCTTCTGGG GGAGATCAAC AACGGCCCCC GTTCGGTCGA AGTCACGTGC
AAGTTCACCG GCCCCGATCG ACTTGACGCC GGCTATGCGA TCGACTTGCC GGAACGCGGG
CCCAGCACGG AGGTACTGCG CGACCTTCTC ACCCGGCTGG AGGATGCTGA AGGATGGACC
TGGTGGCTCA ACAGCCATGA CCAGAAACGC GCGGTCTCGT CCTTCGGCGA TGGCGGGGCA
GCGGATGCGA AGATGCTCGC AGCGTTCCTT TGCGCGCTGC CCGGCCCCCT CTTGCTGTTT
CAGGGCGAGG AACTGGGGCA GCCACAGGCA GAGCTCGAAA AGGTCGAGCT GACCGATCCT
TATGACCTGA TGTATTGGCC CGACTCGGTG GGTCGCAACG GCGCCCGCGC GCCCATGGCC
TGGGACGACA CGCAGCCCGC ATGCGGCTTC AGCAAAGCGG TGCCGTGGCT ACCTATGGCG
CGGGCGGAAC AGGGCGGCGT GGCACAGCAG GAGGCCGACC CGGCCTCGGT TCTCGCCTTT
TACCGCGATG CACTTGCCCG GCGGCGTGAC CTGGGGCTTG CCGAGGCCAC GATGGAACTC
GAAGACGCGC CTGATGCCTG CATTCGGTTC CGGCTGCGCG TTGGGACGCT CGTTGTGCAG
GTGGCCGCCA ACATGTCCGG CGCGCCACAA GACCTCGCAC CCAGACAGGG TGCAAAACGG
ATCTTGCAGA CCAAGCCCCC CGCGCCGGGC AGCAACCTTC CGCCCCGCAG CGCTGCCTGG
TGGCTGTTGG AGAAAGGCTA G
 
Protein sequence
MPRRGPWPEN PVIYQVYPRS FLDTTGTGEG DLPGVTRQLD YIAGLGVDGI WLSPFYPSPF 
CDGGYDIADH CAVDRRFGTL DDFDALVARA HDLDLRVMID LVLNHTSDTH DWFAKSLARE
EGFEDVYIWA DPCKDGSPPS NWLSFFGEAA WRWHPQRAQY CLHKFLPCQP CLNHYNDRVH
ERLNRITRFW RDRGVDGFRY DAVTSFFYDP GFRDNPPAAE AEAALIPGPS NNPYTFQEHI
HDVLPNECAA FAETLREMAG PDAYLLGEIN NGPRSVEVTC KFTGPDRLDA GYAIDLPERG
PSTEVLRDLL TRLEDAEGWT WWLNSHDQKR AVSSFGDGGA ADAKMLAAFL CALPGPLLLF
QGEELGQPQA ELEKVELTDP YDLMYWPDSV GRNGARAPMA WDDTQPACGF SKAVPWLPMA
RAEQGGVAQQ EADPASVLAF YRDALARRRD LGLAEATMEL EDAPDACIRF RLRVGTLVVQ
VAANMSGAPQ DLAPRQGAKR ILQTKPPAPG SNLPPRSAAW WLLEKG