Gene TM1040_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1421 
Symbol 
ID4078051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1516976 
End bp1518391 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID638006731 
Producttetratricopeptide TPR_2 
Protein accessionYP_613416 
Protein GI99081262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0844206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0579144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTAG AAAACAGCCA TACTCCCGAT GCCTTGGCTG AGGCATTTGC AACCGTCGAC 
GGTCTTATGG CCACGGGCAA ATTTGCACGC TCTCTCAGCG TGATGCTGCC TTTTGTTCAA
ACAGGTGAGC TGGATGCTGG CCTGCTGGAT CGCACCGCCG ATTGCTTCTT TGAAATGCGT
GACTACGAGA ATGCGGTAAA CGTCATGCGC CATGTCACTG CAACCTGGCC CGATGATCCC
TCCGCATGGG GCAAGCTTGG CCTGATGCTG CAAACCAAGG GTGATCTCCT TGGTGCCGAG
CAGGCCTTTG AGGAAGTGCT GAAGCGCGAT CCCAATTCGA TCCCGGCGCT GACTGCGCTC
AACGGGATTG AGACATTTTC CGTCGACAGC CTCTATGCCC AGCGCCTGAT GTCGCTCTCC
GAGCGTGAGG ATCTGGATGC CAAACACCGT GCCTTGATCC ACTATGCGCT GGCCCAGATC
GCCCATGGGT CGGGCGAGGC AGAGGTCGCA TTTACGCTGT TCCAATCTGC GCGCAACGAG
GTGGCCGGTC CTTTTGCGCC TGCGATTTTC GACGAGATGG TCGCCGAGCA AGAAGCCTTG
TTTGAACCGC GCCCGGCCTC TGAGGATGCG TCGCTGCTGC CCAAACTGGT CTTTATCGGC
GGTATGCCGA TGTCTGGCAC CGGTCTGGTG GATCGAATTC TCGCACAGCA CCCTGGCGTG
TTCAGTGTTG GCGCAAAAAC GGCGCTCTCG CGCACCCATG GTGCCATCCG CATGCATCTC
GCCAAGACCG ACCGTCCCTG CAACTACTGG GATTGGATGG AGCATCTGAG CGCAGAAGAA
ATCGACATCT TCCGTCAGTA CTACCTCGAG CGCGCCCTCG GCGGTCAGGT GGCTGGTGGC
AAGACCATTG TTGACGCGCA TCCGCTGGGA TGTCTGGAAT TCGGCCTTGC ACAATTCCTC
TTCCCCGAGG CCAAATTTGT CTTCATGTCG CGTCACCCGA TGGACACGGC GCTGGCCAAT
ATCGCGTCCA ACGTGCTGAA CGGCAATGCG CTGGCATCGC GTACCGAATG GATCGCGCAG
GTGATGCGCA CGGTCTATTC CTCGGCAACC GTCTATGCTT CCAAACTTGG CGACAGCATG
CGCCTGCAGT CCTACGAGGC GCTGGTGCAA AACTCCGAGC GTGAAATCGG CCTCCTCTTG
GAGCATGCGG GTCTTGAATA CAACGCCGCA TGCCTCACAC CAAATGCGCT CTGTGACGTG
CCGCAGATTG CGACCATGCT TGGTCAGGAA GAGCTCTCGA CTGAGACGCA GAACCAGTGG
CTCCCCTATG AGGAGCAGCT GCAGGGGTTC TACGAGCAAC TGGGCGGCGA ACGCTGGGTC
TCTGCTTGGG AAGATTTCGA CAAGACGCTG CGCTGA
 
Protein sequence
MLVENSHTPD ALAEAFATVD GLMATGKFAR SLSVMLPFVQ TGELDAGLLD RTADCFFEMR 
DYENAVNVMR HVTATWPDDP SAWGKLGLML QTKGDLLGAE QAFEEVLKRD PNSIPALTAL
NGIETFSVDS LYAQRLMSLS EREDLDAKHR ALIHYALAQI AHGSGEAEVA FTLFQSARNE
VAGPFAPAIF DEMVAEQEAL FEPRPASEDA SLLPKLVFIG GMPMSGTGLV DRILAQHPGV
FSVGAKTALS RTHGAIRMHL AKTDRPCNYW DWMEHLSAEE IDIFRQYYLE RALGGQVAGG
KTIVDAHPLG CLEFGLAQFL FPEAKFVFMS RHPMDTALAN IASNVLNGNA LASRTEWIAQ
VMRTVYSSAT VYASKLGDSM RLQSYEALVQ NSEREIGLLL EHAGLEYNAA CLTPNALCDV
PQIATMLGQE ELSTETQNQW LPYEEQLQGF YEQLGGERWV SAWEDFDKTL R