Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2627 |
Symbol | |
ID | 4077930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2759858 |
End bp | 2761072 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638007951 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_614621 |
Protein GI | 99082467 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.800354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAGA CCGCGCCCTC GCCCCCAGCC CCAGCCTCTG CCCCCGGCTC CGATCCCGGT TCTGATCCCG GCCCTGCCCA CGTCCGGCCC CCCGGACGGG ATGCAACACA AGGCCCGCGC CTGTTGTTCT TTTCCGGTGG CACGGCCCTC AACGAGATCT CGCGCAGGCT CAAGGCCTAC ACGCAGAACT CGGTGCATCT GATTACACCC TTCGACAGCG GTGGCTCATC GCAGGTGCTG CGCAAAGCCT TTGGCATGCC CGCGGTCGGC GACCTGCGCA GTCGACTTAT GGCGCTGGCG GATGAAACCG ATAGAGGCCA GCCAGAGATC CTGCGCCTGT TCACCCATCG CTTTGCAAAA CACGCGTCAG AGCGGAATGT GACACAGGAC GCTGCCCGGC TCTTTGAGGG CACGCATCCG CTTTTGCAGG GGATTCCAAC GCCGGTACGA CAGCAAATCC GCGAAGACCT GCGCCAGTTT CAGGACGCCG CCCCGGCGGA TCTGGACTAT CGCAACGCCA GCATCGGCAA CCTGATCCTC GCGGGGGGCT ATCTGCGTCA CGGCCGCCAG CTTGAGCCGG TGCTTGCCCA GATGTCGCGG ATGGTGGCGG TGCGCGGCAC CGTGCGCCCG ATTGCGGATG TGAACCTGGA GATCGGTGCA GAGCTTCGGG ACGGGCGGCG CGTCATCGGT CAGCGCCGGA TGACGGGCAA GGAGCACGCA CCGCTCACCA GCCCTATCGC GCGCCTCTTT CTGTCAGATG GCACCCGCGA ACTGCCTGCG GATGCGGTGC CCCTCCCGCA AAGCAACCAA GACCTCATCG CCGGGGCGGA CCTGATCTGC TACCCGCCCG GCAGTCTCTA TTCGAGCGTG ATCTGCAATC TCCTGCCCAA AGGTGTGGGC CAGGCCATTG CCGCGCGCAA CGTGCCAAAG GTCTATGTCC CAAGCCTCGG CACAGATCCG GAATGTCTGG CGATGACGCT CTCGGATCAG ATCTCTGCCT TACTGGCACC GCTGCGCCGG GACGCTGGTG ATGTGGCCAC CTCAGCCTTT CTCAGCCATG TGATCTGTGA CCTCAGCGTG TCAGAGGCGG CACGCGCAGA GGTCCTGCGC GATCACGGCA TCCCCTGCAT CGCGCGGCCT TTGGCGGTCT CTTCGGGCAA GTCGCCCTGC TATGCGCCGG ATGCCCTCTG CAGACAGCTA CTGGCACTGG CCTGA
|
Protein sequence | MSKTAPSPPA PASAPGSDPG SDPGPAHVRP PGRDATQGPR LLFFSGGTAL NEISRRLKAY TQNSVHLITP FDSGGSSQVL RKAFGMPAVG DLRSRLMALA DETDRGQPEI LRLFTHRFAK HASERNVTQD AARLFEGTHP LLQGIPTPVR QQIREDLRQF QDAAPADLDY RNASIGNLIL AGGYLRHGRQ LEPVLAQMSR MVAVRGTVRP IADVNLEIGA ELRDGRRVIG QRRMTGKEHA PLTSPIARLF LSDGTRELPA DAVPLPQSNQ DLIAGADLIC YPPGSLYSSV ICNLLPKGVG QAIAARNVPK VYVPSLGTDP ECLAMTLSDQ ISALLAPLRR DAGDVATSAF LSHVICDLSV SEAARAEVLR DHGIPCIARP LAVSSGKSPC YAPDALCRQL LALA
|
| |