Gene TM1040_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1145 
Symbol 
ID4078441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1232945 
End bp1234120 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content63% 
IMG OID638006449 
Producthypothetical protein 
Protein accessionYP_613140 
Protein GI99080986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0013509 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0789492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTTT CCCGCACCAA CGCCACTTTT GTTTCGCCAA TCGTGCAATC CCGGCGCTGG 
CTGGAAGATG TGACCTTTTC CGACGATCTG CCGCTCCTCA ATCTCAGTCA GGCCGCCCCG
GCCGATCCAC CACCAGAAGG GCTGCGTCAG GCGATGGCCG ACGCCCTGTT GTCAGAGACC
AGCGCCCATC TCTACGGACC GGTGCTTGGC AATGCGGATC TGCGCAAAGC TCTCGCCGCC
CAAATCACGC GCCACTACGA GGCGGACATC ACACCGTCCG AGGTCGCGAT CACCTCGGGC
TGCAATCAGG CCTTTGCCGC CACCATCCAG AGCCTCTGCG CCGAAGGTGA CGAGGTCATC
CTGCCGACGC CATGGTACTT TAACCACAAG ATGTGGCTCG ACATGCAGGG GGTCACCACC
CTGCCCTTGC CTGTGGGCGA CGGAATGTTG CCAGAAGTCG CCAAGGCCGA AGCGCTGATC
ACGCCGCGCA CTCGCGCAAT CGTCCTGGTC ACACCCAACA ATCCCTGCGG CGTTGAATAT
CCCGCGGCCT TGATGGACGC GTTCTTTGAG CTGGCACAGC GTCATGGGCT CACGCTGATT
GTGGACGAGA CCTATCGCGA TTTCGACAGC CGCGATGGCG CGCCCCACGG GCTGTTTTCG
CGCAAGGACT GGCATGAAAC CCTGATACAT CTCTATTCCT TCTCCAAAGC CTATCGGCTG
ACCGGCCATC GCGTGGGCGC GATCGTGGCT GCGCCCGAGC GCCTGCTCGA GATGGAGAAA
TTCCTCGATA CCGTCACCAT CTGTCCCGCG CAGATCGGCC AGATCGGCGC CAAATGGGGG
ATCGAGAACC TCGATGACTG GCTCGCTGGC GAGCGCGGCG AGATCCTCGC GCGGCGCGAC
GCCATTGCCG CCGGATTTGC TCCGCTGGCG GCAAAAGGCT GGCAGCTTTT GGGACTGGGG
GCCTATTTCG CCTATGTGGC GCATCCGTTC GCGGCAAGCT CCGACGAGAT AGCACAACGC
CTGCTTCACT CGGCTGGCAT GCTGCTTTTG CCGGGCACGA TGTTCACACC GGCTGGCGCC
CCCGAGGGCC ACCGCCAGTT CCGCATTGCC TTTGCCAATG TCGATCAGAC CGGCATTGCG
GAGATGCTCG CGCGACTGGC GCAGTTTGAC GGTTGA
 
Protein sequence
MHLSRTNATF VSPIVQSRRW LEDVTFSDDL PLLNLSQAAP ADPPPEGLRQ AMADALLSET 
SAHLYGPVLG NADLRKALAA QITRHYEADI TPSEVAITSG CNQAFAATIQ SLCAEGDEVI
LPTPWYFNHK MWLDMQGVTT LPLPVGDGML PEVAKAEALI TPRTRAIVLV TPNNPCGVEY
PAALMDAFFE LAQRHGLTLI VDETYRDFDS RDGAPHGLFS RKDWHETLIH LYSFSKAYRL
TGHRVGAIVA APERLLEMEK FLDTVTICPA QIGQIGAKWG IENLDDWLAG ERGEILARRD
AIAAGFAPLA AKGWQLLGLG AYFAYVAHPF AASSDEIAQR LLHSAGMLLL PGTMFTPAGA
PEGHRQFRIA FANVDQTGIA EMLARLAQFD G