Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1142 |
Symbol | |
ID | 4078438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1227757 |
End bp | 1229316 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638006446 |
Product | hypothetical protein |
Protein accession | YP_613137 |
Protein GI | 99080983 |
COG category | [S] Function unknown |
COG ID | [COG2861] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0823689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGCG CGCGTGAGAA AGAGCACAGG CGAGAAGGCG AAACAGGCAT GCGAGGATTT CTGGGTGGCG TGAGCGTAGG GGCGCTGGTC GCAGTTGCGG GGGCGGCGGT GTGGTCGCTG TCGACGCCAT TGCCGCAGGC GGTCGATGTC TCGCAGGAAC TGCCCGTCAC AACATTGCAG CCGCGTGAGG TTCCAGCGCC CCCGTCCGAT GTGCCCGGAA GCGACGCAGA TCTGGTTGAG GCCGCCCCAG CGGAACCCGA TGCAAATGCC CTTGGCCGAG ACAGCGCCCC AGATGAGATC GACACTTCGC TGCCAGAGCG TCCCAGCGTT TCGACCGATC CCGAGGTCAC ATTGGATGGC GCCCCGGTTC AGAGCGACAC CCCCCGGATT GCGGTGACTT CCGATGCGGA TGCGCCCGTG TCTGGTGGCG ATGACAGCAT CCGCCCTGAG GCGCCCTCGC AAGACACCAC ACCGGATCTG GGAGCAGATT CCGCCACACG GCCCGAGGTT TCGGCGGCCT CAGACCTGCC ACTGCAGGCG CCAGACGCGG AAATCCCCAC CCCCGATCTG TCGACCGAGG CCGATCCGGC GCCCCTGCGC CAGGAGACGC TTCAAGTGGA GGCGCCAGAT ATCGGCACGC CTCCGGATGT GACCGCGTCG CCCGTGTTGA CCACTCCGCC CGTGCGGTCG CTCACCCCTT CTGATCCGGT CGAGGACAAC GCGGGGGCAA ATGGGGGCGT GAGAATCGCG GATCTGCCGC AGGCCTCCGA GGACCCAGAG ACAAATGCAG GACCCAGTAT CGGCACGCGC GTGCTGCCAC TGACAGAACG CGACACCACG GGCGCGGATG CGTCTTCTGC AGCAGACAGT AGTGCTGCGC CCTTTGTGCG CAACAGCGAG CCGGCAAACC CGGAGCCGGG CCTGCCCTTG ATGTCGATTG TCCTCGTCGA AGAAGAAGGC GCGGTGGGCG CCGAGGCGCT TGAGGATTTC CCCTATCCGC TGACGTTTAC CATCGACCCG AGCGACCCGG ATGCGGTGGC ACGCATGAAA GCCCGGCGCG CGGCGGGGTT TGAGGTCATG GTGTTGGCGG ATCTGCCCCG CGAAGGCCAG CCGCAGGACG CCGAAACCGC GATGCCGGTG TGGTTTGACC GCCTTCCAGA GGCGGTGGGC ATCCTTGAGG GCATCGACAG CGGCGTGCAG GGCAACCGGG CGCTTGCGGA TCAGGTGGCC AGCATCGCCG GTGATCTGGG CTATGGGCTG GTGCTACAGG ACAATGGCCT GAACACGGTC CACAAAATGG CGCTGCGCGA TGGTATTCCT TCGGGCGTGG TGTTTCGCGA CTTTGACGGC GCGGGTCAGG ACCCGCGCGC CATGCGCCGT TTTCTGGACC AGGCCGCGTT CCGCTCCGGT CAGGAGGGCG CGGTCATCAT GCTGGGACGT CTGAAGCCGG ACACGATTTC CGCGCTGCTG ATCTGGGGGC TGCAAGACCG CGCCAGCAGC GTGGCGCTGG TGCCGATCTC GACCAGTCTC AAACGCCTGC TGGAGCCGGT CTCAAACTAA
|
Protein sequence | MGGAREKEHR REGETGMRGF LGGVSVGALV AVAGAAVWSL STPLPQAVDV SQELPVTTLQ PREVPAPPSD VPGSDADLVE AAPAEPDANA LGRDSAPDEI DTSLPERPSV STDPEVTLDG APVQSDTPRI AVTSDADAPV SGGDDSIRPE APSQDTTPDL GADSATRPEV SAASDLPLQA PDAEIPTPDL STEADPAPLR QETLQVEAPD IGTPPDVTAS PVLTTPPVRS LTPSDPVEDN AGANGGVRIA DLPQASEDPE TNAGPSIGTR VLPLTERDTT GADASSAADS SAAPFVRNSE PANPEPGLPL MSIVLVEEEG AVGAEALEDF PYPLTFTIDP SDPDAVARMK ARRAAGFEVM VLADLPREGQ PQDAETAMPV WFDRLPEAVG ILEGIDSGVQ GNRALADQVA SIAGDLGYGL VLQDNGLNTV HKMALRDGIP SGVVFRDFDG AGQDPRAMRR FLDQAAFRSG QEGAVIMLGR LKPDTISALL IWGLQDRASS VALVPISTSL KRLLEPVSN
|
| |