Gene TM1040_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1459 
Symbol 
ID4077756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1557176 
End bp1560106 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content61% 
IMG OID638006770 
ProductRNAse E 
Protein accessionYP_613454 
Protein GI99081300 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.80055e-06 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.379566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA AAATGCTGAT CGACGCCACC CACGCAGAGG AAACTCGCGT TGTGGTGGTG 
GATGGAAACA AAGTTGAGGA ATTCGACTTT GAATCTGAAA ACAAACGCCA GCTGGCTGGC
AATATCTATC TTGCAAAAGT AACCCGCGTC GAGCCTTCGC TGCAGGCCGC TTTTGTGGAC
TATGGCGGCA ACCGCCATGG CTTCCTCGCG TTTTCGGAAA TTCACCCGGA CTATTACCAG
ATCCCCGTCG CGGACCGCGA GGCCCTGATG GAAGAAGAGC GCGCCTATGC CGAGGCAATG
CGCGCCCGCG ATGAAGAAGA CGACGAACCC AAGCCCAAGC GCCGCAGCCG CTCGCGCAGC
AAGACCCGCG CTGAAAAAGC CAAGACCGCT GACGCCGTTG AAACAAAAGA AGTCGCCACC
CCCGAGGGTG AGATTTCCGG AATGGAGACA ATTGACCTCA GCGATGACAG CACGGTGCTG
GCAGAGGTGC CGGAGGCGAC CTCGCCGATG GAAACCGTAG CAGAGACCCC GGTCGAGGAA
CCGGTGGGCG ATGATGCAGC GGCAGAGATG CCTGAGGCGG ACACAACTGA CGGCGAAGCG
CCAGCGGGCG ACGACAGCGT CGTAGCAGAA CCCGCGACAG ATGGCGAAGC CGAAACCGTG
GCCGAAGACT CCGCCGCTAT TGAAGAGGCG GATGCGGACG ACGACACCGA TGAAGGTGGC
ACAGAGGATG CGCCCGAGGC GAAAACCGGT GGGGCTGATG CAGCCGACAA GGATGACAGC
ATCGAGTCGG TTGCCGACGA CGATGATCAG GAAGACATTC GTCCGCCGCG CAAACCGCGC
CCCAAACGGT ACAAGATCCA AGAGGTCATC AAGGTCCGTC AGGTGCTGTT GGTGCAGGTT
GTCAAGGAAG AGCGTGGCAA CAAGGGCGCT GCGCTGACCA CTTACCTGTC TCTTGCAGGT
CGCTATTGCG TTCTGATGCC CAACACCGCC CGTGGCGGCG GCATCTCGCG CAAGATCACC
AATGCCGCAG ACCGCAAGAA ACTCAAGGAC ATCGCGACCG AACTTGATGT GCCGACCGGG
GCGGGACTCA TCGTGCGCAC CGCCGGGGCC AAACGCACCA AGGCGGAGAT CAAGCGCGAC
TATGAATACC TTCAGCGCAT GTGGGAGCAG ATCCGCGAAC TGACGCTGAA ATCCATCGCG
CCCGCAAAGA TCTATGAAGA GGGCGACCTC ATCAAACGCT CGATCCGCGA CCTGTATAAT
CGCGAGATCG ACGAGGTTCT GGTTGAAGGC GAACGCGGCT ACCGCATCGC CAAGGACTTC
ATGAAGATGA TCATGCCGTC CCACGCCAAG AACGTGAAAA ACTACCAGGA TCAGCTGCCG
CTCTTTGCGC GTTATCAGGT GGAAAGCTAT CTCGGCGGGA TGTTCAATCC GACCGTTCAG
CTGAAGTCGG GCGGCTATAT CGTGATCGGC GTGACCGAGG CACTGGTGGC AATCGACGTG
AACTCCGGCC GGGCAACTAA GGAAGCATCG ATCGAGGAAA CCGCGGTCAA GACCAACCTG
GAGGCGGCCG AAGAAGTGGC GCGCCAGCTG CGTCTGCGCG ATCTCGCGGG TCTCATTGTG
ATCGACTTCA TCGACATGGA CGAGCGCAAG AACAATGCCG CCGTCGAGAA GAAGCTCAAA
GACAAGCTCA AGACCGATCG TGCCCGTATT CAGGTGGGCC GGATCTCCGG CTTTGGCCTC
TTGGAAATGT CGCGCCAGCG TCTGCGTCCG GGCATGATCG AGGCAACAAC AGCGCCTTGT
CCGCATTGTC ACGGCACCGG ACTTATCCGG TCGGACGACA GCATGGCGCT GTCCATCCTG
CGTCAGATCG AAGAGGAAGG CACCCGCCGC CGGTCGCGCG AAGTGCTGGT AAAATGTCCC
GTGGACATCG CGAACTACCT GATGAACCAG AAGCGCGAGC ATATCGCTCA GATCGAAGCG
CGCTATGGCC TGTCGGTCCG GATCGAAGGC GATGTGACCC TCGTCAGCCC GGATTTCTCG
CTTGAGAAGT TCAAGACGGC ATCGCGCGCG ATCCCTGCAG TCACGACCCC TGTGGTGTCG
GTGGATGCAT CGATCATGGA TCAGGTCGAT GCCATTGAAG AGGTTGCTGA AACCCCGGAG
GAAGCGCCTG CGCCCCAAGC CCCTGAAGCC GATGAAACCG AAGGCGAGGC AAAGCCCAAG
CGCAAGCGTC GTCGCCGGCG TCGCAAGAAG TCTGGCAATG GCGAGAACGG TCAGGACGCT
GACGCGCAGG CAAGTGAGGC AAATGCCTCC GATGCCTCCG ATGCGGCGAC AGACTCTCAG
GATGCTGAGT CAGATTCCAA AACCGAAACC GCGCCGGATG CCGCTGGCGA AGGCGACGGA
GAGGCGGAGC CCAAGAAGAA GCGCACTCGG ACGCGCACCC GCAGCCGCTC TCGCAAGAAG
GTGGAAACCG AGGTAGCTGC AGAGGCAGAG GCTCAGGACG CCGCTCCGGA AGCGACGCCT
TCGGAAGCCT CTGAGGCAGA AGAGGCAGCC CCAGCCGAAC CGGCAGCACC TGCTCCGGTT
GAGGATGCGG CTGTTGATGA GGCGCCTGTT GTCGCTGCAG AGACCACTGC ATCCGAAGAG
GCAACTGCGG AGGAAGCACC CTCCGAGGCA CCGCAGAACA CTGCAGACTC GCAAGAGGCG
CCGAAGGCCG AGACTGTTGT TGCCGAGACG ACACCTGCTC CGGTCGCAGC AGAAGAGGCC
GCATCTGAAG AAGTTGCTCC TGAAGAGACC CCGGAGCCGG TTCAGGAGGA GACCCACGCA
GAGGTTGAAC CCGAAGAGGC GAAGCCGGAA CCGGCGCTTG CGGTTGCGGA ACCTGAACCC
GAGGAACCGG CCAAGCCCAA ACGGCGCGGC TGGTGGTCCG TGGGTCGTTA A
 
Protein sequence
MAKKMLIDAT HAEETRVVVV DGNKVEEFDF ESENKRQLAG NIYLAKVTRV EPSLQAAFVD 
YGGNRHGFLA FSEIHPDYYQ IPVADREALM EEERAYAEAM RARDEEDDEP KPKRRSRSRS
KTRAEKAKTA DAVETKEVAT PEGEISGMET IDLSDDSTVL AEVPEATSPM ETVAETPVEE
PVGDDAAAEM PEADTTDGEA PAGDDSVVAE PATDGEAETV AEDSAAIEEA DADDDTDEGG
TEDAPEAKTG GADAADKDDS IESVADDDDQ EDIRPPRKPR PKRYKIQEVI KVRQVLLVQV
VKEERGNKGA ALTTYLSLAG RYCVLMPNTA RGGGISRKIT NAADRKKLKD IATELDVPTG
AGLIVRTAGA KRTKAEIKRD YEYLQRMWEQ IRELTLKSIA PAKIYEEGDL IKRSIRDLYN
REIDEVLVEG ERGYRIAKDF MKMIMPSHAK NVKNYQDQLP LFARYQVESY LGGMFNPTVQ
LKSGGYIVIG VTEALVAIDV NSGRATKEAS IEETAVKTNL EAAEEVARQL RLRDLAGLIV
IDFIDMDERK NNAAVEKKLK DKLKTDRARI QVGRISGFGL LEMSRQRLRP GMIEATTAPC
PHCHGTGLIR SDDSMALSIL RQIEEEGTRR RSREVLVKCP VDIANYLMNQ KREHIAQIEA
RYGLSVRIEG DVTLVSPDFS LEKFKTASRA IPAVTTPVVS VDASIMDQVD AIEEVAETPE
EAPAPQAPEA DETEGEAKPK RKRRRRRRKK SGNGENGQDA DAQASEANAS DASDAATDSQ
DAESDSKTET APDAAGEGDG EAEPKKKRTR TRTRSRSRKK VETEVAAEAE AQDAAPEATP
SEASEAEEAA PAEPAAPAPV EDAAVDEAPV VAAETTASEE ATAEEAPSEA PQNTADSQEA
PKAETVVAET TPAPVAAEEA ASEEVAPEET PEPVQEETHA EVEPEEAKPE PALAVAEPEP
EEPAKPKRRG WWSVGR