Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0041 |
Symbol | |
ID | 4076308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 43680 |
End bp | 44879 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005328 |
Product | HI0933-like protein |
Protein accession | YP_612036 |
Protein GI | 99079882 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.32138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGACT GGGATGCGGT CGTGATTGGA GGCGGTCCGG CTGGATTGAT GGCAGCAGGG GAGGTCGCGC GTCAGGGCCA TCGGGTGCTC TTGGTGGAAG CCAAGCCGTC GCCTGCGCGC AAGTTTCTGA TGGCCGGAAA ATCCGGTCTA AACCTGACCA AAGACGAACC CTTCGAGGAT TTGCTGGCGC AGTATGGCGA CTCGGCTGAG TGGCTGGCAC CGATGATCAA GGCGTTTGAC GCTGCGGCTG TGCAAGACTG GGCGCGCGGG CTCGGGCAGG AGCTTTTTAC CGGGTCGACG GGACGGGTGT TTCCCACGGT CATGAAGGGC TCTCCCTTGT TGCGGGCGTG GCTTCAGGAT CTGGACACAC ATCGCGTTAC CCGGCAACTC GGCTGGCGCT GGACAGGTTG GCAGCAGGAC GGGCAGCTGT TGTTCGGTAC GGCTGCAGGG CCACAAATCG TGACATCCCG CGCCACCATA CTGGCGCTTG GTGGTGCGAG CTGGGCACGG CTTGGTTCGG ATGGCGCCTG GGCGGCCCTA TTGGCGTCGC GCGGCGTCGC CTTGGCACCG TTTCAACCGT CGAACGCTGC CCTTTCGGTC GCCTGGAGTG ATCACATGAC GCCGCATTTT GGCGCAGCAC TCAAGGCAGT GGCCTGGCAG GCAGGCGCAT TGCAGGCCCG TGGCGAGGCG ACCTTGTCGC AACGCGGGCT CGAAGGCGGT GGGCTTTACA CGTTGACCCC AGCTCTGCGC GAAGGGCAGC CGCTTTTTGT CGACTTGTCG CCGGACCTCA ACGAGGGCGA TCTTGCCCGG CGGCTCGCGA AACCGCGTGG TAAGACGAGC TGGTCGAACC ACATGCGCCG CACGCTCAAG CTTGCGACGG TGAAAATGGC ACTATTGCAA GAGTTTGGCC GCCCGCTGCC GCAGGATCCA GAGAGCCTGG CCCGTCTCAT CAAACATTTG CCCGTGCGCC ATACGGGGTT ACGCCCAATG GACGAGGCAA TCTCGACGGC TGGCGGTGTG CGCCGCGATG CCTTGGATGA CGGCCTCATG CTAAAGGCGA TCCCCGGCAC GTTTTGCGCC GGAGAGATGC TTGATTGGGA TGCGCCAACG GGAGGGTATC TCTTGACTGC CTGCTTTGCG ACCGGCCGTT GGGCAGGGCA AGCGGCGGCG CGCTACTTGG CGAGTTCCGC CACGCGCTGA
|
Protein sequence | MQDWDAVVIG GGPAGLMAAG EVARQGHRVL LVEAKPSPAR KFLMAGKSGL NLTKDEPFED LLAQYGDSAE WLAPMIKAFD AAAVQDWARG LGQELFTGST GRVFPTVMKG SPLLRAWLQD LDTHRVTRQL GWRWTGWQQD GQLLFGTAAG PQIVTSRATI LALGGASWAR LGSDGAWAAL LASRGVALAP FQPSNAALSV AWSDHMTPHF GAALKAVAWQ AGALQARGEA TLSQRGLEGG GLYTLTPALR EGQPLFVDLS PDLNEGDLAR RLAKPRGKTS WSNHMRRTLK LATVKMALLQ EFGRPLPQDP ESLARLIKHL PVRHTGLRPM DEAISTAGGV RRDALDDGLM LKAIPGTFCA GEMLDWDAPT GGYLLTACFA TGRWAGQAAA RYLASSATR
|
| |