Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1223 |
Symbol | |
ID | 4075931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1313953 |
End bp | 1315497 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006531 |
Product | protein of unknown function DUF853, NPT hydrolase putative |
Protein accession | YP_613218 |
Protein GI | 99081064 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00822087 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGG AAATATTCGT CGGTGGCGGA GGGGACGCAT ATGGCGATCC ACAATACCTC ACGTTGAAAT ATGCCAACCG GCACGGTCTG ATTGCGGGCG CAACGGGAAC CGGTAAAACC GTTACACTTC AAATCCTTGC CGAGAGTTTC TCGAATGAAG GCGTGCCGGT CATCTTGGCG GATGTCAAAG GAGACCTGTC CGGACTGGCG CGCGCAGGCA GCGAAACGGC GGAGTTGCAT GCGCCTTTCG TGAAAAGAGC ACAAAAAATT GGATTTACCT CTTTTTCCTA TCACGACACC CCCGTCACCT TCTGGGATCT CTTTGCACAA CAGGGCCATC CGATCCGGAC CACGGTTGCC GAAATGGGGC CTTTGCTGCT GTCGCGTCTG CTGGAACTCA GCGAAGCACA GGAGGGTATC CTGAACATTG CCTTCCGGCT TGCGGATGAG CAGGGGCTGC CGCTGCTGGA TCTCAAGGAT CTACAAGCGC TGTTGGTCTG GGTCGGCGAG AACCGCGAAA GCCTGTCCCT GCGCTACGGC AATGTCTCCA CCGCTTCGAT CGGCGCCATC CAGCGCCGCC TGCTGGTTCT GGAGAACCAG GGCGGGGCGC TGATCTTTGG CGAGCCGGCC TTGGATCTTG AGGATCTGAT GCGCTTTGAT GCTGCGGGCC GGGGCATGGT GAACATTCTG GCCGCGGATA AATTGATGGC TTCTCCAAAG CTTTACGCGA CGTTCTTGCT GTGGCTGTTG AGCGAGCTGT TCGAGAGCCT TCCTGAAGTC GGAGATCCGG AAAAGCCCAA GCTGGTGTTC TTTTTCGACG AGGCGCATCT CCTGTTTGAA GACGCACCCA AAGCCCTCAT CGACAAGGTG GAACAGGTCG CACGGCTGAT CCGCTCCAAG GGGGTTGGGA TCTATTTTGT CACCCAGTCT CCGGACGACA TTCCTGAGGA TATTCTGGGG CAGTTGGGCA ATCGCATCCA ACATGCGCTG CGTGCGTTTA CGGCGCGGGA TCAGAAGAAG CTGAAGCTTG CCGCAGAAAC CTATCGTGCA AACCCGCGTT TTTCGACCGA AGACGCCATC CGTGAGGTCG GCGTCGGCGA GGCGGTGACC TCCATGCTCG AGAAAAAGGC CGTACCTGGC GTGGTGGAGC GGACGCTTAT TCGCCCGCCC TCGAGCCAGC TTGGACCGAT CACCGAAGAG TTCCGCAGGA GTGTAATGCA AGCGTCTGAT ATGGCGGGAA AATATGACAA GTCTGTTGAT CGCCATTCAG CCTATGAAAT CCTGAAAGAG CGGGCGGACA AAGCCTCGAG AGAAGCGGCA GACGCCGAGG CGCAAGCCGA AACAGCGCCA GATCCGGTGG TGCGCGAGTT CAGCGCCGCG CGACGGTATA GCGGCAGTCG CGTGGGGCGA TCCACCTCGC GCCGGATCGG CGGCGGTGAC ACTTTTGCCT CCGCCATGTC CGAGTCGGTG ATCAAAGAAC TGAAAGGCAC CACCGGGCGG CGCATCGTTC GCGGGATTCT GGGCGGGCTC TTCAAGGGGC GCTGA
|
Protein sequence | MAEEIFVGGG GDAYGDPQYL TLKYANRHGL IAGATGTGKT VTLQILAESF SNEGVPVILA DVKGDLSGLA RAGSETAELH APFVKRAQKI GFTSFSYHDT PVTFWDLFAQ QGHPIRTTVA EMGPLLLSRL LELSEAQEGI LNIAFRLADE QGLPLLDLKD LQALLVWVGE NRESLSLRYG NVSTASIGAI QRRLLVLENQ GGALIFGEPA LDLEDLMRFD AAGRGMVNIL AADKLMASPK LYATFLLWLL SELFESLPEV GDPEKPKLVF FFDEAHLLFE DAPKALIDKV EQVARLIRSK GVGIYFVTQS PDDIPEDILG QLGNRIQHAL RAFTARDQKK LKLAAETYRA NPRFSTEDAI REVGVGEAVT SMLEKKAVPG VVERTLIRPP SSQLGPITEE FRRSVMQASD MAGKYDKSVD RHSAYEILKE RADKASREAA DAEAQAETAP DPVVREFSAA RRYSGSRVGR STSRRIGGGD TFASAMSESV IKELKGTTGR RIVRGILGGL FKGR
|
| |