Gene Dshi_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3824 
Symbol 
ID5714353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp30459 
End bp31769 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID641276739 
Productphenylacetate-CoA ligase 
Protein accessionYP_001542035 
Protein GI159046364 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0827131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.36821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACC TGAGCCCGAA CCGGGCGGAG CTGGACCCGA TCGAGATCGC CAGCCGCGAC 
GAGATTGCGG CGCTCCAGCT CGACCGGATG AAATGGTCCC TGCGCCATGC CTATGACAAT
GTGCCGATGT ATCGCGCCCG GTTCGACGCG GCCGGGGTGC ATCCCGACGA CCTGCGCGAC
TTGAAGGATC TCGCGAAGTT TCCCTTCACC CACAAGAGCG ACCTGCGCGA CCACTATCCT
TTCGGCATGT CGGCGGTGCC GCGCGACAGG CTGGTGCGGG TGCATGCCTC GTCGGGGACC
ACGGGCAAGC CGACGGTGGT GGTCTATACC CGCCACGATA TCGAGGTCTG GGCCGACACC
CTGGCGCGCA GTCTGCGGGC CTCCGGCCTC AGGGCGGGCG ACATGATCCA CAATGCCTAT
GGCTACGGGC TGTTCACCGG GGGTCTGGGC GCCCATTACG GCATCGAGAA GCTGGGCGCG
ACGGTCATTC CCATGGGCGG CGGGCAGACC GAAAAGCAGG TCAGCCTGAT CCATGATTTC
CGGCCGACCG CCATCATGGT GACGCCGTCC TACATGCTCA ATATCCTTGA AGGGTTCCAC
AAGGCGGGTC TGGATCCGCG GCAATCCTCG TTGCAGGTGG GCGTGTTCGG GGCCGAGCCC
TGGACCAACG CCATGCGCCA GGAGGTCGAA GAGGCCTTCG ACATGCACGC GGTGGACATC
TACGGGCTGA GCGAGGTCAT GGGGCCGGGG GTGGCCAATG AATGCGTGGA GACCAAGGAC
GGGCTCCATG TCTGGGAAGA CCATTACTAT CCGGAGATCA TCGACCCGCA GACCGGCGAG
GTGCTCAAGG ACGGTGCGGA GGGCGAGCTG GTCTTCACCA CCCTGACCAA GGAGGGGATG
CCGATGATCC GCTACCGCAC GCGCGATCTG ACCCGGCTTC TGCCCGGCAC GGCGCGCAGC
ATGCGGCGGA TCGAAAAGAT CACCGGGCGC TCCGACGACA TGATGATCCT GCGCGGGGTC
AATGTCTTTC CGACCCAGAT CGAGGAACAG GTGATGGCCA CCGGCGGGCT GGGCCCGTAT
TTCCAGATCG AACTCTACAC CTCGGGGCGG CTGGACGCGA TGCGGGTCTT CGTCGAGGCG
ACCCCGGCGG CGGCGGACGA GCTGTCGAGA ACCGCCGCGG CCCGTGTCCT GACCAAGCAT
GTCCGGGACA TGGTCGGGGT GTCCATCGAA GTGGTCGTCG GCGACCCCGG CTCGGTCGCG
CGCAGCCAGG GCAAGGCCGT CCGCGTCATC GACAACCGCA AAAAGGACTA G
 
Protein sequence
MKDLSPNRAE LDPIEIASRD EIAALQLDRM KWSLRHAYDN VPMYRARFDA AGVHPDDLRD 
LKDLAKFPFT HKSDLRDHYP FGMSAVPRDR LVRVHASSGT TGKPTVVVYT RHDIEVWADT
LARSLRASGL RAGDMIHNAY GYGLFTGGLG AHYGIEKLGA TVIPMGGGQT EKQVSLIHDF
RPTAIMVTPS YMLNILEGFH KAGLDPRQSS LQVGVFGAEP WTNAMRQEVE EAFDMHAVDI
YGLSEVMGPG VANECVETKD GLHVWEDHYY PEIIDPQTGE VLKDGAEGEL VFTTLTKEGM
PMIRYRTRDL TRLLPGTARS MRRIEKITGR SDDMMILRGV NVFPTQIEEQ VMATGGLGPY
FQIELYTSGR LDAMRVFVEA TPAAADELSR TAAARVLTKH VRDMVGVSIE VVVGDPGSVA
RSQGKAVRVI DNRKKD