Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1145 |
Symbol | |
ID | 4078441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1232945 |
End bp | 1234120 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006449 |
Product | hypothetical protein |
Protein accession | YP_613140 |
Protein GI | 99080986 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0013509 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0789492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCTTT CCCGCACCAA CGCCACTTTT GTTTCGCCAA TCGTGCAATC CCGGCGCTGG CTGGAAGATG TGACCTTTTC CGACGATCTG CCGCTCCTCA ATCTCAGTCA GGCCGCCCCG GCCGATCCAC CACCAGAAGG GCTGCGTCAG GCGATGGCCG ACGCCCTGTT GTCAGAGACC AGCGCCCATC TCTACGGACC GGTGCTTGGC AATGCGGATC TGCGCAAAGC TCTCGCCGCC CAAATCACGC GCCACTACGA GGCGGACATC ACACCGTCCG AGGTCGCGAT CACCTCGGGC TGCAATCAGG CCTTTGCCGC CACCATCCAG AGCCTCTGCG CCGAAGGTGA CGAGGTCATC CTGCCGACGC CATGGTACTT TAACCACAAG ATGTGGCTCG ACATGCAGGG GGTCACCACC CTGCCCTTGC CTGTGGGCGA CGGAATGTTG CCAGAAGTCG CCAAGGCCGA AGCGCTGATC ACGCCGCGCA CTCGCGCAAT CGTCCTGGTC ACACCCAACA ATCCCTGCGG CGTTGAATAT CCCGCGGCCT TGATGGACGC GTTCTTTGAG CTGGCACAGC GTCATGGGCT CACGCTGATT GTGGACGAGA CCTATCGCGA TTTCGACAGC CGCGATGGCG CGCCCCACGG GCTGTTTTCG CGCAAGGACT GGCATGAAAC CCTGATACAT CTCTATTCCT TCTCCAAAGC CTATCGGCTG ACCGGCCATC GCGTGGGCGC GATCGTGGCT GCGCCCGAGC GCCTGCTCGA GATGGAGAAA TTCCTCGATA CCGTCACCAT CTGTCCCGCG CAGATCGGCC AGATCGGCGC CAAATGGGGG ATCGAGAACC TCGATGACTG GCTCGCTGGC GAGCGCGGCG AGATCCTCGC GCGGCGCGAC GCCATTGCCG CCGGATTTGC TCCGCTGGCG GCAAAAGGCT GGCAGCTTTT GGGACTGGGG GCCTATTTCG CCTATGTGGC GCATCCGTTC GCGGCAAGCT CCGACGAGAT AGCACAACGC CTGCTTCACT CGGCTGGCAT GCTGCTTTTG CCGGGCACGA TGTTCACACC GGCTGGCGCC CCCGAGGGCC ACCGCCAGTT CCGCATTGCC TTTGCCAATG TCGATCAGAC CGGCATTGCG GAGATGCTCG CGCGACTGGC GCAGTTTGAC GGTTGA
|
Protein sequence | MHLSRTNATF VSPIVQSRRW LEDVTFSDDL PLLNLSQAAP ADPPPEGLRQ AMADALLSET SAHLYGPVLG NADLRKALAA QITRHYEADI TPSEVAITSG CNQAFAATIQ SLCAEGDEVI LPTPWYFNHK MWLDMQGVTT LPLPVGDGML PEVAKAEALI TPRTRAIVLV TPNNPCGVEY PAALMDAFFE LAQRHGLTLI VDETYRDFDS RDGAPHGLFS RKDWHETLIH LYSFSKAYRL TGHRVGAIVA APERLLEMEK FLDTVTICPA QIGQIGAKWG IENLDDWLAG ERGEILARRD AIAAGFAPLA AKGWQLLGLG AYFAYVAHPF AASSDEIAQR LLHSAGMLLL PGTMFTPAGA PEGHRQFRIA FANVDQTGIA EMLARLAQFD G
|
| |