Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3492 |
Symbol | |
ID | 4075132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 524505 |
End bp | 526412 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005007 |
Product | Beta-galactosidase |
Protein accession | YP_611726 |
Protein GI | 99078468 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAA CCCTTGGCAC CTGCTATTAC CCCGAGCACT GGCCAGAAGA GATCTGGGCC GAAGACGCCG CGCGTATGAA AGCTGCGGGC CTGACGTGGA TCCGGATTGG GGAATTTTCG TGGTCGAAAC TCGAACCCAC TCCCGGCGAT CTGCACTGGG ACTGGCTCGA CCGCGCGATC GAGACATTGG GGGCGCAAGG CTTGCGGGTG GTGCTCGGCA CCCCCACAGC GACCCCGCCG CGCTGGATGG CCGAGCGTCA CCCGGACATG TTTGCTGTCA CCGCCGAGGG CCAGCCACGC GGCTTTGGCT CCCGGCGGCA CTATTGCTTT AGCCACAAGG GTTATTTTGC CGAGAGCCAG CGCATCACCC GCCTGATGGC AGAGCGCTAT GGTGCCAACC CGCATGTGGC CGCGTGGCAG ACCGACAATG AATATGGCTG CCATGATACC GTGATCAGCT ATTCCGACGC GGCTCAAACC GCTTTTCGGG CGTGGCTGGC AGAGCAGTTC GACGGCGAGA TCGGCGCGCT CAACGCGGCC TGGGGCAATA TGTTCTGGTC CATGGAGTAT CGCAGCTTTG ACGAAATCGG CCTGCCCAAC CTCACCGTGA CGGAGCCGAA CCCGGCGCAT GTGCTGGCGT TCAGACGCTT CAGCTCCGAT CAGGTGGTGG CTTTCAACCG CGCGCAGGTC GAGATCATCA AGGCCCATTC AACCGCGCCG ATTTCTCATA ACTACATGGG GCGGATCACC GATTTCGACC ACTTCAAACT AGGTGAGGAT CTCGAGATCG CGACCTGGGA CAGCTACCCG CTGGGCTTTC TGGAAGACCG CGTGGGGGCC TCACCCGAGG AACAGCGCGC TTATGCCCGG CAGGGGGATC CGGATTTTCA GGCCCTTCAT CACGATCTCT ATCGCGCGGT TGGGCGCGGG CGCTGGTGGG TCATGGAACA GCAGCCGGGG CCAGTGAACT GGGCGCCCTA CAACCCGGCA CCCCTGCCGG GCATGGTGCG GCTCTGGACC TGGGAGGCCT TTGCCCATGG CGCCGAGGCT GTGTGTTATT TCCGCTGGCG GCAGGCGCCT TTTGCGCAGG AACAGATGCA CGCAGGCCTC TTGCGTCCCG ACAGCCAGGA CGCCCCCGCC ATGCAAGAAG CGATGGATGT TGCGGCAGAG CTTGGCGCGG CAGCCGATGT GCAGCCCGCG CAGGCACCGG TGGCGATCCT TTTTGATTAC GATGCCGATT GGGCGTGGTC GACGCAGCCG CATGGTGCAG GGCTGAGCTA TTTCCAGCTG ATCCTCGAAC ACTACAAGGC GCTTCGGCGC GCTGGTCAGA CCATCGACAT CCTGCCCCCG GAGACCCGCG ATTTCACGGG GTACAAGATG ATCCTTGCGC CCGGGATGAT GCATCTGCCA GAGCCCCTCA AAGAAGCGCT CGCAAGAAGT GAGGCCGAGG TGCTCTATGG TCCGCGCAGC GGTGCGCGTG ACGGTCATTT CTCCATCCCG ACCAGCCCAC TGCCACCTGC ATTGCCGGGG CTGGACGTGA CCGTGGCACG GGTGGAGAGT CTGCGCCCGG ATATGCCCAT CGCCCTTAAG GGCGGTGGTG CAGTGCGCGG CTATCTTGAG GAGCTTGAAG GCACTGCAGA AGTGGTCTTT GAAACCTCCG AGGGCGCGGC GGTCGCGCTC CGGGCCGGGC GACAGACTTA TTGTGGCGGC TGGCTCGATG CAGAGGGGCT TGATCGGTTG ATTGCCACCG TTGCGCAGGC GGCGGGTCTG GAGTTGCGCC AGATGCCGGA AGGGGTGCGC ACCCGCCGCA CGGCAACCGA GGTCTTCTGG TTCAACCACA GCGCAGAGCC TGTCGAAACC GAAGTTGGCC TCTTGCCTCC GGCGGGGGTG AAACGGATCG CGCTTTAG
|
Protein sequence | MKRTLGTCYY PEHWPEEIWA EDAARMKAAG LTWIRIGEFS WSKLEPTPGD LHWDWLDRAI ETLGAQGLRV VLGTPTATPP RWMAERHPDM FAVTAEGQPR GFGSRRHYCF SHKGYFAESQ RITRLMAERY GANPHVAAWQ TDNEYGCHDT VISYSDAAQT AFRAWLAEQF DGEIGALNAA WGNMFWSMEY RSFDEIGLPN LTVTEPNPAH VLAFRRFSSD QVVAFNRAQV EIIKAHSTAP ISHNYMGRIT DFDHFKLGED LEIATWDSYP LGFLEDRVGA SPEEQRAYAR QGDPDFQALH HDLYRAVGRG RWWVMEQQPG PVNWAPYNPA PLPGMVRLWT WEAFAHGAEA VCYFRWRQAP FAQEQMHAGL LRPDSQDAPA MQEAMDVAAE LGAAADVQPA QAPVAILFDY DADWAWSTQP HGAGLSYFQL ILEHYKALRR AGQTIDILPP ETRDFTGYKM ILAPGMMHLP EPLKEALARS EAEVLYGPRS GARDGHFSIP TSPLPPALPG LDVTVARVES LRPDMPIALK GGGAVRGYLE ELEGTAEVVF ETSEGAAVAL RAGRQTYCGG WLDAEGLDRL IATVAQAAGL ELRQMPEGVR TRRTATEVFW FNHSAEPVET EVGLLPPAGV KRIAL
|
| |