Gene TM1040_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1622 
Symbol 
ID4077724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1728820 
End bp1730139 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID638006935 
Producthypothetical protein 
Protein accessionYP_613617 
Protein GI99081463 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.264122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TCCTGGAACT GCGAGCCCGC CGCGCGGGTA TCATCGACCG TATGGACGCG 
CTGGTTGCGT CAATCGGCGA TGGGGAGGAA TGGACAGAGG ATCAGACTGC CCAATTCGAT
GCCCTGAAGG CCGAAGATGA TAAGGTGACG GCAGAGCTCA CCCGCCTTGA AGATGTGGAG
CGCCGCCGCG CTGAGGCCGC GCGTCCGGCC GCGCCGCTGC CTGGTGCTGC GGGCACCGAA
GCTGGTGGGG TACCGACTGC ACCCGCAGCC CCGAAGGAGC CAGGCCTGCA GTTCGCTCGC
ATGGTGCGCA CGATCGCGGC GGCGGGCGGC AACCAGTATG TTGCACAGCA GATCGCCGAG
GCGAGCGGAG ACAGCGGTCT TTTTGCCAGC CAGAACATGT CCACCGGCAC CGCCGGCGGA
TTTCTGGTGC CGGAAGATGT GTCCAGTGAG GTGATCGAGC TGTTGCGCCC GCTCAGCGTC
GTTACAGCGA TGGGCCCGCG TATTGTTCCC ATGCCGAACG GGAATATGAC TACCAACCGC
CGCGCGAGCG GAGCAAATTT CGAATATGGC GGTGAGCAGC AGGACATCAA GGCAACCGGA
TACGAGTATG GTCAGGTGAA GCTGTCGGCG AAGAAGCTGA GCGGGATCAT CCCGATATCC
AATGACCTGC TGCGCACGGC CTCCACGGCC GTCGACCGAA TGGTGCGCGA TGATGCACTG
GCCGATGCTG CGCAGATCCA GGATCGTCAT TTCCTCCGCG GTGCGGGAAC AGATTATGCG
CCAAAGGGGC TTCGTTTCCA GCACACGGGC ACGCCTTTCG CCGCGACCCA TGTGCTGACG
ATGACCGCTG CGCCGGATCT GCAAAAGGTG GATAACGATC TCGGCCGCCT CGAGCTCGCT
CTGGCGAACA ACAATGTCGT TGTGACCGGG GCGCATTGGA TCATGTCGCC GCAAATTGCG
ATGTTCCTGA CCAACCTGCG CGACGGCAAT GGCAACAAGG TTTATCCGGA GATGGCCAAT
GGCCAGCTGC GCATGAAACC GGTGCACATC ACCACCGAGA TCCCGAGTAA CCTTGGTGGA
GGCGGCAACG AGTCCGAGAT CATGCTGGCG CATCCGGGTC ACATCCTTGT TGGTGAGCAC
ATGGGCATTG AAGTCGCGAT GTCTACCGAA GCGGCCTACA AGGACTCCGC GGGCAATATG
CAGGCCGCGT TCTCTCGCGA CGAGACACTG ATGCGGATGA TCATGCAGCA TGACATTGGC
CTGCGCCATC TGCCAGCCGT GGCCGTCCTT ACGGGCGTCA CTTGGGCACC CGGCCTCTGA
 
Protein sequence
MDKILELRAR RAGIIDRMDA LVASIGDGEE WTEDQTAQFD ALKAEDDKVT AELTRLEDVE 
RRRAEAARPA APLPGAAGTE AGGVPTAPAA PKEPGLQFAR MVRTIAAAGG NQYVAQQIAE
ASGDSGLFAS QNMSTGTAGG FLVPEDVSSE VIELLRPLSV VTAMGPRIVP MPNGNMTTNR
RASGANFEYG GEQQDIKATG YEYGQVKLSA KKLSGIIPIS NDLLRTASTA VDRMVRDDAL
ADAAQIQDRH FLRGAGTDYA PKGLRFQHTG TPFAATHVLT MTAAPDLQKV DNDLGRLELA
LANNNVVVTG AHWIMSPQIA MFLTNLRDGN GNKVYPEMAN GQLRMKPVHI TTEIPSNLGG
GGNESEIMLA HPGHILVGEH MGIEVAMSTE AAYKDSAGNM QAAFSRDETL MRMIMQHDIG
LRHLPAVAVL TGVTWAPGL