Gene TM1040_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1300 
Symbol 
ID4078499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1392128 
End bp1393741 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content59% 
IMG OID638006608 
Productphage portal protein, lambda 
Protein accessionYP_613295 
Protein GI99081141 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.616273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC GACTGTTCAG ACGCGAAGAT CCTGTTGCGC CGGAGCCGGA TCGCAAGCCC 
CCTCCAATGC TGACGGCACC CCGAAGGCGT GGCCAGCGCA TGTTTGCCGC CGCAGAGACA
GATCGCATGA CAAGCGGTTG GACCAATTCA CCAATGCCAG CGGATCAAAT CATCCGCCGC
AATTGGCGCG TGCTGGTGGC ACGTTCGCGC GAGCAGTCGG CGAACAACGA CTATGCAAAG
GCGTTCAAGG CCAGCGCACG GCGAAACCTG ATTGGACAGA AGGGGTTCAC ACTGCAGGCG
CAGGCGTTTG ACGGTGACAA GCTGGACGCA CAGGCAAACA AGGCCATCGA ACGCGCATGG
CGCGCCTGGT GCAAGGCGAT GAATTGCGAC GTCAAAGGGC GTCGCACCTT ACGGCAGATC
CAGAAAACGA TTGTGAACGG CCTTTGCACC GATGGCGAGT TTATGGTGCG CATGGTGGTT
GGGCGGGATG CGGGCCCGTG GGGATTTGCA TTGCAGATCC TCGACCCGGT GTTGTGCCCG
GTCGATTTTG ATGAGGATCG TCGCCCCGGT GGTGGGTTCA TTCGGGCAGG GATCGAATAT
ACGAAGATGG GCCGACCCGT GGCCTATTAT TTCACCACCC TCGATCAATC GCAGGCCGAT
TATCACTATT CCGGGCGGGC GTTCATCCGG GTTCCGGCGG ATGAAATTAT CCACTGGTTT
GAAGAGGATT TTGTCGGGCA GAAACGCGGG CTGCCGTGGA TGGCGACCGC GCTCTTGCGT
ATGCGCCAAC TTGGCGAGTT CGAGAAAAGC GCTCTGAACA ACGCTCGTGA AGGCGCGAAC
AAGGTCGGCG TGATCGAATG GGACGAAGGG TTCGGCCCTG AGCCAGAAGA GGACGACGCC
ACCAAGAGCG ACGATGATCT AGGCTTTGAG GACATCGAAC TCGATAGTGA AATGGGAGTT
TATCATCAGC TCCCGATGGG TGCGCGTCTA AAGCGGGTCG AAACCGGATA TCCAAACGGC
GAAATGGCGG TGTTTTCAAA GCACATGCTG CGCGGGGTCG CGACAGGCTT GGGCGTTGCC
TACAACGATC TTGCCAATGA CCTTGAGGGC GTGAATCTAT CGAGCATCCG CCACGGCGTT
TTGAGTGAGC GGGACCAGTG GATTGAGTTG CAAGAGAGCC TGATCGAGGC CTTTGCCTTG
CCGATCTATG AGCGTTGGCT CGAATACTCG CTGCTGAAAC AGAAAATCAC CCTCGACAAC
GGATCGCCGC TGCCAGCGAG CAAGCGGTCG AAGTTCATGG CGGTGACCTT CCAGGCACGG
CGCTGGCAGT GGATTGATCC TGCAAAAGAC GTGACGGCCG ACGCCGACGC CGTCGACAAC
CTGTTCAAGT CGCGCGGTCA GGTGATCCGC GAGCGCGGGC GCGACCCGCG CGAGGTCTAT
GCCGAGGTCG CAGAAGATAT CGCAGCGATG CGCGAGGCGA AAATCCCCGA CAACGTGATC
GAGGCCTTGA TCACAGCCAA ATCAAAAGGA GGGCAGGGCA GTGGACAGCC AGCCAAAACC
GGCACCGGAG AAACCGACCC AGACGCCGAC CCAGACCCCG ACAAAAGCGA ATGA
 
Protein sequence
MIWRLFRRED PVAPEPDRKP PPMLTAPRRR GQRMFAAAET DRMTSGWTNS PMPADQIIRR 
NWRVLVARSR EQSANNDYAK AFKASARRNL IGQKGFTLQA QAFDGDKLDA QANKAIERAW
RAWCKAMNCD VKGRRTLRQI QKTIVNGLCT DGEFMVRMVV GRDAGPWGFA LQILDPVLCP
VDFDEDRRPG GGFIRAGIEY TKMGRPVAYY FTTLDQSQAD YHYSGRAFIR VPADEIIHWF
EEDFVGQKRG LPWMATALLR MRQLGEFEKS ALNNAREGAN KVGVIEWDEG FGPEPEEDDA
TKSDDDLGFE DIELDSEMGV YHQLPMGARL KRVETGYPNG EMAVFSKHML RGVATGLGVA
YNDLANDLEG VNLSSIRHGV LSERDQWIEL QESLIEAFAL PIYERWLEYS LLKQKITLDN
GSPLPASKRS KFMAVTFQAR RWQWIDPAKD VTADADAVDN LFKSRGQVIR ERGRDPREVY
AEVAEDIAAM REAKIPDNVI EALITAKSKG GQGSGQPAKT GTGETDPDAD PDPDKSE