Gene TM1040_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0090 
Symbol 
ID4078756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp92456 
End bp94201 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content63% 
IMG OID638005377 
Productheparinase II/III-like 
Protein accessionYP_612085 
Protein GI99079931 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.379483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAT ATGACCGAAT GGCCAGTCGC GGCACGCGAC TGCTGAACCG CTATACCGCC 
TGGAAGGCGC GCAAACAGCC CGCGGCCACT GGGTTTGTCT CGCAGCCCGA GCCGCGTACC
ATAGGCAGTT TTGCGCGTGG GCGGCAGCTG GTGGCCGGCA ACCTCCTGTT TGCAGGCTAT
CTGGTAGAGA GTGACACCAC TGGCCTTTGG GATGTGGAGG CGCCTGATTT TGCCTTTGAG
GCAGAGCGCC AGGGCTGTAC ATGGCTCGAT GATCTGGCCG CGGTCGGCGA TCTGAAGGCG
CGCAGCAAGG CCCAGCACTG GGTCCGGGGC TGGATCGATG AGTTTGGCAA GGGCACTGGT
CCCGGCTGGT CGCCGGATCT GACCGGGCGG CGCGTGATTC GATGGATCAA CCACGCCCTA
TTCCTGCTGA GTGGTCAGGA CAAACCAGCC TCCGATGCCT TTTATCGGTC TCTCTCGCAG
CAGACGTGGT TCCTGTCGCA GCGCTGGAAG GGGGCATCGC CTGGCTTGCC GCGGTTTGAG
GCGCTTACCG GGCTGATCTA TGCGGGGCTT GCGCTCAAGG GCTGCGAGGA ACTCGCAGAC
CCTGCGGTCA AAGCGCTGGC TCAGGACTGC GCGCAGCAGA TCAACGCCGA GGGCGGCCTG
CCGACCCGCA ACCCCGAAGA GCTACTGGAT GTGTTCACCT TGCTCACATG GGCCGCAGCG
GCGTTGCATG AAGCGGGACG GTCGGTGCCG CGCGAGCATC ACGCCGCGCT GGATCGTATC
GCACCCACAC TGCGGGCCTT GCGCCACAGC GACGGGGCGC TGGCGCGGTT TCATGGCGGC
GGGCGCGGGC AGGAGGGCTG GCTCGATCAC GCGCTGGCCG CCTCGCATGT CCCTGCAAGA
CCCTTTGAAG GGCTGGCGAT GGGGTTTGCA CGACTCTCGG CGCGGCGCAC TTCGCTCATC
ATTGATGCCA CCGTGCCACC CGTTGGCAAG GCCTCCTACA ATGCACACGC TTCGACTTTG
GCTTTCGAGC TAACGTCCGG GCGGCGTCCC TTGATCGTGA ATTGCGGCGC GGGCGAGAAT
TTCGGGCTAG AGTGGCGTCG GGCGGGACGG GCCACGCCCT CTCATTCTGC GCTCTGTATC
GAAGGGCACT CAAGCGCCCG GCTGGCCGCG CCGCAAAAGG GCACGGGACA TGAGTTCCTG
ATCGACGCGC CCACCGATGT GCCAATCGAG CGCGAAGACC TTGTGGATGG GTATCGGTTT
CAGGGTGCTC ATGATGGCTA TGCCAAATCC TATGGCGTGA CCATTGCGCG CTCTTTGGAA
CTGTCGGTGG ACGGGCGCAT GGTGTCGGGC GAGGACATGG TGCTGGCACT TGATGACGCC
GCAAAAAAGT GCTTCGACAG GGCGCTGGAC GCGGGCGGCC TGCGCGGTAT TGGCTATGAT
TTACGGTTCC ATTTGCACCC GGATGTGGAC GCAGCCCTTG ACTTAGGGGG CGCAGCAGTA
TCCATGGCGC TCAAGAGTGG GGAAATCTGG GTATTCCGTC ACGATGGTCA ATGCGACCTC
AAGCTGGAAA CCAGCGTTTA CCTGGAAAAG GCCCGCTTGA AGCCGCGTCA ATCGCTGCAA
ATCGTCCTGT CGGGCCGGGC CATTCAATAT GCGACCCAGA TCCGCTGGAC CCTCAGCAAG
GCGCAGGAAA CGGCTGTGGC CGTGCGCGAC TTGGCCCGCG ACGACCCCAT GGCCTACGAA
GAGTGA
 
Protein sequence
MSKYDRMASR GTRLLNRYTA WKARKQPAAT GFVSQPEPRT IGSFARGRQL VAGNLLFAGY 
LVESDTTGLW DVEAPDFAFE AERQGCTWLD DLAAVGDLKA RSKAQHWVRG WIDEFGKGTG
PGWSPDLTGR RVIRWINHAL FLLSGQDKPA SDAFYRSLSQ QTWFLSQRWK GASPGLPRFE
ALTGLIYAGL ALKGCEELAD PAVKALAQDC AQQINAEGGL PTRNPEELLD VFTLLTWAAA
ALHEAGRSVP REHHAALDRI APTLRALRHS DGALARFHGG GRGQEGWLDH ALAASHVPAR
PFEGLAMGFA RLSARRTSLI IDATVPPVGK ASYNAHASTL AFELTSGRRP LIVNCGAGEN
FGLEWRRAGR ATPSHSALCI EGHSSARLAA PQKGTGHEFL IDAPTDVPIE REDLVDGYRF
QGAHDGYAKS YGVTIARSLE LSVDGRMVSG EDMVLALDDA AKKCFDRALD AGGLRGIGYD
LRFHLHPDVD AALDLGGAAV SMALKSGEIW VFRHDGQCDL KLETSVYLEK ARLKPRQSLQ
IVLSGRAIQY ATQIRWTLSK AQETAVAVRD LARDDPMAYE E