Gene Daci_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4099 
Symbol 
ID5749686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp4505638 
End bp4506882 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID641299201 
ProductPBSX family phage terminase large subunit 
Protein accessionYP_001565115 
Protein GI160899533 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000271787 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAAGG CAAGGATTGA ACTGCCGCCC AAGCTGATCC CGGTGTTCAG TGGTGAGGCG 
CGATACCGGT GCGCGCACGG TGGCCGGGGA TCTGCAAAGA CCCGCAGCTT CGCGATGATG
ACGGCGGTCC GCGCGTACAT GTTCGCCGAG GCCGGCGTGT CTGGGGTGAT CCTTGGCGCG
CGGGAATACA TGAACAGCCT GAGCGAGTCC TCGATGGAGG AGATCAAGCA GGCCATTCGC
TCGGTGCCTT GGCTTGATGC CTACTTCGAG ATCGGTGAGC AGTACATCCG CACGAAGAAT
CGCCGCGTCT CCTACGTGTT CGCCGGCCTG CGCCACAACC TGGACAGCAT CAAGTCCAAG
GCGCGCATCC TGATCGCCTG GGTGGACGAG GCCGAAAGCG TCAGCGAGGT GGCTTGGCAG
AAGCTGGCCC CCACGGTGCG CGAGCAGGGC TCCGAGATCT GGGTGACATG GAACCCGGAG
AAGGACGGCA GCCCCACGGA CAAGCGATTT CGCAAGGAGC CGCCACCGAA TTCGAAGGTT
GTCGAGTTGA ATTACTCGGA CAACCCCTGG TTTCCCGAGG TGCTCGACCA AGAGCGCCAG
GCCGACCGCG ACCGGCTGGA CGACCAGACC TACGCCTGGG TATGGGATGG CGCGTACCGC
GAGAACAGCG ATGCGCAGAT CCTGTCTGGC AAGTACCGCG TGGCCGAGTT CGAGCCTCAG
CCGGGCTGGG ATGGACCGTA CTTCGGCCTG GACTGGGGAT TCTCGCAAGA CCCGACCGCC
GGCGTGAAGT GCTGGGTCGG GGATGGCCGG CTCTGGATCG AATACGAGGC CGGCAAGGTC
GGCCTGGAGA ACGACGACAT CGCCGACTAC GTGATCCAGC GGCTGCCGGG CATCGAGCAG
CACACGGTGC GCGCCGACTC CGCGCGCCCC GAGACGATCA GCCACGTCAA GAGCAAGGGC
CGCGACGGCA AGCGCCAGTG CCTGCCCAAG CTCGAAGCCG TGGAGAAGTG GAAGGGCAGC
GTAGAGGACG GCATCGCCCA CCTGCGCGGC TACAAGGAAA TCGTCATCCA TGAGCGCTGC
ACTCAGGTGC TGCGCGAGGC CCGGCTCTAC AGCTACAAGG TGGACCGCAA GAGCGGCGAC
GTGCTCACGG ACATCGTGGA CGCGAACAAC CACTACATCG ACGCATTGCG TTATGCGCTT
GGCCCGCTGA TCAAGCGCGC TGGGTATTCC TGGAGAGGCT TCTGA
 
Protein sequence
MSKARIELPP KLIPVFSGEA RYRCAHGGRG SAKTRSFAMM TAVRAYMFAE AGVSGVILGA 
REYMNSLSES SMEEIKQAIR SVPWLDAYFE IGEQYIRTKN RRVSYVFAGL RHNLDSIKSK
ARILIAWVDE AESVSEVAWQ KLAPTVREQG SEIWVTWNPE KDGSPTDKRF RKEPPPNSKV
VELNYSDNPW FPEVLDQERQ ADRDRLDDQT YAWVWDGAYR ENSDAQILSG KYRVAEFEPQ
PGWDGPYFGL DWGFSQDPTA GVKCWVGDGR LWIEYEAGKV GLENDDIADY VIQRLPGIEQ
HTVRADSARP ETISHVKSKG RDGKRQCLPK LEAVEKWKGS VEDGIAHLRG YKEIVIHERC
TQVLREARLY SYKVDRKSGD VLTDIVDANN HYIDALRYAL GPLIKRAGYS WRGF