Gene TM1040_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1144 
Symbol 
ID4078440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1230946 
End bp1232781 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content63% 
IMG OID638006448 
Productputative peptidyl-prolyl cis-trans isomerse D 
Protein accessionYP_613139 
Protein GI99080985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.186687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0843243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TTTCAAAGAC CTTCGTTTGG ATCCTCATGG GTCTGTTGTT CGTGGGGCTT 
GCGGGCTTTG GTGCGATCAA CGTATCCGGC ACCACCCGCA CGCTGGCCAC CGTGGGTGAT
GCCACCGTCA GCGTTGATGA CTACGCCCGC GCCCTGCAAC AGGAACAACG TGCCATTCAG
GCCCAGTCCG GCCAGTCCAT TCCCCTGTCG CAGCTGATCG CCATGGGCGT GGATCGCGGT
GTGCTCTCGG GGCTTGTGGC TACGGCCGCG CTCGACAATG AGCTCACAGA GATCGGCCTG
TCTGTGGGCG ACGAAGTGCT GCTCGAAGAG ATCACCCGGA TTTCGGCCTT CCAGGGGTCT
GATGGCAGCT TCAGCCGCGA CAACTACCGT TTTGCCCTTG ATAACGCAGG TCTGTCCGAG
GCTGAGTTCG AGGCCGACAT GCGGTCTGAA ACCGCGCGCA CGCTGCTGCA AGGGGCCATC
ACCGGTGGCA CCCGCATGCC CGCGGTGCTG GGCGAGACAC TCACCAACTA TATCGGTGCG
CGTCGCAGCT TTTCCTATGT TCGGCTCGCC GAGGCCAACG TCCCCCTGAC CGCGATCGCG
CCCACCGAGG ATGCGCTCAA GGCTTATTAC GAGACCCACA TCGAGGATTT CACCCTGCCC
GAGACCAAGG TGATCACCTA TGTCGCCCTG CGCCCCGACG CGCTGGTGGA TCAGGTCGAG
GTGGATGAAA CCGCTCTGCG CGGTCTCTAC GAGGAGCGCG ACGCAGAATT CAACGTCCCC
GAGCGCCGCC TTGTCGAACA GCTGGTGTTC CCCGACGAGG ACGCCGCCCA AACCGCCAAG
GCCGAGCTCG ATGTGGGGGG GACCACCTTT GAAACCCTGG TGGCAGATCG CGAACTCTCC
CTGTCGGACA TCGACATGGG TGATGTGGCG ATTGGCGACC TTGGCGCTGC GGGCGAAGCC
GTCTTTGCCG CCGACGTGGG CGCCGTGGTT GGCCCGCTTC CATCAGATTT TGGCCCCGCG
CTCTACCGTG TGAACGGCGT GCTCGAAGCG AATTTCACCA GCTTTGAAGA GGCCCTGCCG
GTCTTGCGCG AAGAGCTGGC TCTGGATCGT GCCCGCCGCC TGATTGAATC CCGCGCCGAA
GGGCTGGATG ATGCGCTCGC CGGTGGCGCG ACGCTCGAAG ATCTCGCCGA GGAAGAAAAC
GGCGTGGTGC TCGGCCAGCT TGACTGGACC GCCAATACCA CTGACGACAT TGCGGCCTAT
GACACCTTCC GCAACGCGGC AGAGGCCGTG ACGACCGAAG ATTTCCCCGA AGTGGGCTAT
CTCGAGGATG GCAGCCTCTT TGCGTTGCGC CTCGATGAGG TCCTGCCCCC GCGCCCCGAG
CCCTTTGAGG ACGCGCGCAC AGCCGTTTCC GAAGCCTACG AGGGCGACCG GATCGCCAAG
GCGCTTGCCG CGCAGGCCGA ACAGATCAAA ACCGCGACCG AGGGCAACAA CGGCCAGTTC
CCCGAAGGTC GCGAAGTGAT CGAGCAGACC GGCCTTACCC GGACCGCCTA TCTCGATCAG
ACCCCGGTTG ATCTCCTCAA TCAGGTGTTC GAAATGGAAG TGGGCGAAGT GCGTGTTGTC
TCCGACGCAG AAGGCACTGT TGTGGTGCGT CTCGACGAGA TCCTGCCCCC GGATGACGGC
AATGACATGC AGTTCCTGTC TCAGGCGCTC GGCGATCAGC TGAACCAGTC GCTCTCCAAC
GAGATCTTCC AGATCTATCT GCAATCGGTT CAGACCCGCG CCCGTCCGCA GGTCAACCAA
CAGGCCGTGA ATGCTGTTCA CGCCAACTTC CAGTAA
 
Protein sequence
MKKLSKTFVW ILMGLLFVGL AGFGAINVSG TTRTLATVGD ATVSVDDYAR ALQQEQRAIQ 
AQSGQSIPLS QLIAMGVDRG VLSGLVATAA LDNELTEIGL SVGDEVLLEE ITRISAFQGS
DGSFSRDNYR FALDNAGLSE AEFEADMRSE TARTLLQGAI TGGTRMPAVL GETLTNYIGA
RRSFSYVRLA EANVPLTAIA PTEDALKAYY ETHIEDFTLP ETKVITYVAL RPDALVDQVE
VDETALRGLY EERDAEFNVP ERRLVEQLVF PDEDAAQTAK AELDVGGTTF ETLVADRELS
LSDIDMGDVA IGDLGAAGEA VFAADVGAVV GPLPSDFGPA LYRVNGVLEA NFTSFEEALP
VLREELALDR ARRLIESRAE GLDDALAGGA TLEDLAEEEN GVVLGQLDWT ANTTDDIAAY
DTFRNAAEAV TTEDFPEVGY LEDGSLFALR LDEVLPPRPE PFEDARTAVS EAYEGDRIAK
ALAAQAEQIK TATEGNNGQF PEGREVIEQT GLTRTAYLDQ TPVDLLNQVF EMEVGEVRVV
SDAEGTVVVR LDEILPPDDG NDMQFLSQAL GDQLNQSLSN EIFQIYLQSV QTRARPQVNQ
QAVNAVHANF Q