Gene TM1040_3434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3434 
Symbol 
ID4075608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp458378 
End bp459988 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content64% 
IMG OID638004943 
ProductABC transporter related 
Protein accessionYP_611668 
Protein GI99078410 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.560851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTC TGGATATTTC CAACCTCTCG CTCTCGATCG GGGCGCTGCC GATCCTGCGG 
AATGTGACCC TGTCGATAGC GGCAGGCGAG ATCCTCGCGG TGACGGGCGA GAGCGGATCT
GGCAAATCGA TGACGGCTTT TTCCGTCATG CGCCTGTTGC CGCAGGCTGC CAAGATTGAC
GGCGCGATTT ACCTTGAGGA CGAAAACCTC TGCGCCCTCT CCGAAGACGA GATGTGCCAG
CGGCGCGGGC GTGACATCGG TATGGTGTTT CAAGAGCCCA TGACGGCGCT CAACCCGGTG
ATGACCATCG GGGCGCAGGT GATGGAGACG ATCCTCTTGC ACACCGACAC CGCCGAAGCC
GACGCCAAGG CCGAGGCCGC GCGCGTGTTG ACCCGGGTCG GCCTGCCGCC GGAGCGTTTC
CCGATGGATC GCTACCCGCA CGAGCTCTCT GGCGGACAAC GTCAGCGCGT TGTGATCGCG
ATGGCGATTG CGCTCAGGCC CAAGCTCTTG ATCGCGGATG AACCCACAAC CGCGCTGGAT
GTGACTACAC AGGCGCAGAT CCTTGATCTC TTGCGGGACC TGGTGCGCGA ATACGGGATG
GGGCTGCTCA TTATCACCCA TGACCTTGCT GTTGTGGCCG ATCTTGCCGA TCGGATCGTG
GTGATGCGCA AGGGTGAGGT GGTCGAGCGC GGCCCGACCC TGAGCGTGCT CACCCATCAA
CGCCACCCCT ATACGCGCAT GCTGTTTGAG GCGTCCGCTC ACAAGGTTGC GCTGCCTGCC
GCGCCCGAGG CCCCTGCCCC GCTGCTGGAG GTGCGAGGCG CCGTGCGCGA CTATGCAACG
CCTCGCAAAG GCTGGCTGGG CAAACCAGGC ACGTTTCGCG CCGTCAACGA TGTGAGCTTC
ACGCTTCACA AGGGCGAGCG TCTGGGCCTT GTGGGCGAGT CCGGATGCGG CAAATCCACC
CTGACCCGGG CCATTCTGGG CCTCGAAGGG CTGCAAGGCG GAGAGCTGCT GCTGAACGGT
CAGCCCCTGT CGCAACAGCG CAGCGCCGCC CAGCGGCAAG CGATGCAGGT GGTGTTTCAG
GACCCTTATG GCAGTTTCAA TCCGCGCCAC CGGGTTGAGC GGCTGGTGAC GGAACCGTTT
CATCTATCGC CGGACGCCGC AACCGGCCAA GAGCGGCGCG ACCTGGTGGG ACAGGTGCTG
GAGGACGTGG GGCTCAAGGC TTCGGATGCA AACAAGTATC CGCATCAGTT TTCCGGTGGG
CAACGTCAGC GCATCGCCAT TGCGCGCGCG CTGATTACGC GGCCCGAGCT CATCATCTTT
GACGAGGCGG TCTCGGCGCT CGATGTCTCG GTGCGCGCCC GTATTCTCGA CCTGATTGCC
GAGCTCTGCG AGGCCTATGG GCTCACCTAT CTCTTTATCA GCCACGATCT GAGCGTGGTG
CGCGAGGTCT GCGACCGGGT GCTGGTGATG CAGAAGGGCG AAATCGTCGA AGAAGGCCCC
GTCGACGACG TCTTTACCGC GCCACAGCAC AGCTACACGC AGAAACTTCT GGCGGCGGCG
CCGGTGCTGC CGGACATCAC GCCAACCGAG CCCGTTGAGG CCCCCGCATG A
 
Protein sequence
MSLLDISNLS LSIGALPILR NVTLSIAAGE ILAVTGESGS GKSMTAFSVM RLLPQAAKID 
GAIYLEDENL CALSEDEMCQ RRGRDIGMVF QEPMTALNPV MTIGAQVMET ILLHTDTAEA
DAKAEAARVL TRVGLPPERF PMDRYPHELS GGQRQRVVIA MAIALRPKLL IADEPTTALD
VTTQAQILDL LRDLVREYGM GLLIITHDLA VVADLADRIV VMRKGEVVER GPTLSVLTHQ
RHPYTRMLFE ASAHKVALPA APEAPAPLLE VRGAVRDYAT PRKGWLGKPG TFRAVNDVSF
TLHKGERLGL VGESGCGKST LTRAILGLEG LQGGELLLNG QPLSQQRSAA QRQAMQVVFQ
DPYGSFNPRH RVERLVTEPF HLSPDAATGQ ERRDLVGQVL EDVGLKASDA NKYPHQFSGG
QRQRIAIARA LITRPELIIF DEAVSALDVS VRARILDLIA ELCEAYGLTY LFISHDLSVV
REVCDRVLVM QKGEIVEEGP VDDVFTAPQH SYTQKLLAAA PVLPDITPTE PVEAPA