Gene Dshi_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1749 
Symbol 
ID5713316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1814843 
End bp1815970 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID641267667 
Productcarboxylate-amine ligase 
Protein accessionYP_001533092 
Protein GI159044298 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.589468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.393571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC CAGAGTTCAC CCTTGGCATC GAGGAGGAAT ACCTCCTCGT CGACCGCGAC 
AGCCTGCAAC TTGCCGAGGC GCCCGAGGCC CTCATGGCGG CGTGTCGCGA CAAGCTGGAA
GGCCAGGTCA GTCCCGAGTT CCTACAATGC CAGATCGAGA TCGGCACCGG AGTCTGCGCC
GAAATAGCCG AGGCGCGCGC GGACTTGCGC AAGCTGCGCA GCACGGTCGC GGCGGAGGCT
GCGCGCTTCA ACCTGGCGCC CATCGCGGCC TCCTGCCATC CCAGTGCCGA CTGGGCCGAA
CAGCACCATA CCGACAAGGA CCGCTACAAT GACCTGGAAA AGGACCTGGG CGGGGTCGCG
CGGCGTCTGC TGATCTGCGG GATGCATGTG CATGTGGGGC TTGACGATGA CGACCTGCGG
ATCGACCTCT TGCCGCAGTT TTCCTATTTC CTGCCCCATC TGCTGGCGCT GTCGTCCTCC
TCGCCCTTCT GGAAGGGGCA GGATACGGGG CTGGCCTCCT ACCGGCTGAC GGTGTTCGAC
AACCTGCCCC GGACCGGCCT GCCCCCGGTG TTCAACAGTT GGGCCGAGTA CCAGCGCAAC
ATCCATGTGC TGATCGATCT GGGCCTGATC GAGGACAGCT CGAAGATCTG GTGGGATCTG
CGCCCGTCGC ACAACTTCCC CACGCTGGAG AGCCGGATCT GCGATGTCTG CCCGCGGCTG
GAGGATACCT TGAGCCTGGC CGCCGCGACC CAGGCCCTAA TGCGGATGCT CTGGGGGCTC
AAGACGCACA ATATGCGGTG GCGCGCCTAT GATCGCTTCC TGATTTCGGA GAACCGCTGG
CGGGCGCAAC GCTACGGCGC CCGCGGCAGC CTGATCGATT TCGGCCGCGG CGCGGTGGTC
GACAGTACCG AGCTGGTCGA GGAGTTGATC GAGCTGATCG GTGCCGATGC GAGGGCCCTG
GGCGGGTTGG CGGAGGTCGA GCGCCTGCGC GAGATCGCTG CGGACGGCTC GAGCGCGGAT
CGCCAACGCG CTGTACGCCG CGCGGCCCTG GAGGCCGGGC AGAGCGACGC GGAGGCGATG
AACGCCGTGG TGCGCCACCT GATCGAGGAA TTCCACCGCG ACCTGTGA
 
Protein sequence
MSTPEFTLGI EEEYLLVDRD SLQLAEAPEA LMAACRDKLE GQVSPEFLQC QIEIGTGVCA 
EIAEARADLR KLRSTVAAEA ARFNLAPIAA SCHPSADWAE QHHTDKDRYN DLEKDLGGVA
RRLLICGMHV HVGLDDDDLR IDLLPQFSYF LPHLLALSSS SPFWKGQDTG LASYRLTVFD
NLPRTGLPPV FNSWAEYQRN IHVLIDLGLI EDSSKIWWDL RPSHNFPTLE SRICDVCPRL
EDTLSLAAAT QALMRMLWGL KTHNMRWRAY DRFLISENRW RAQRYGARGS LIDFGRGAVV
DSTELVEELI ELIGADARAL GGLAEVERLR EIAADGSSAD RQRAVRRAAL EAGQSDAEAM
NAVVRHLIEE FHRDL