Gene TM1040_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1223 
Symbol 
ID4075931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1313953 
End bp1315497 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content59% 
IMG OID638006531 
Productprotein of unknown function DUF853, NPT hydrolase putative 
Protein accessionYP_613218 
Protein GI99081064 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00822087 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGG AAATATTCGT CGGTGGCGGA GGGGACGCAT ATGGCGATCC ACAATACCTC 
ACGTTGAAAT ATGCCAACCG GCACGGTCTG ATTGCGGGCG CAACGGGAAC CGGTAAAACC
GTTACACTTC AAATCCTTGC CGAGAGTTTC TCGAATGAAG GCGTGCCGGT CATCTTGGCG
GATGTCAAAG GAGACCTGTC CGGACTGGCG CGCGCAGGCA GCGAAACGGC GGAGTTGCAT
GCGCCTTTCG TGAAAAGAGC ACAAAAAATT GGATTTACCT CTTTTTCCTA TCACGACACC
CCCGTCACCT TCTGGGATCT CTTTGCACAA CAGGGCCATC CGATCCGGAC CACGGTTGCC
GAAATGGGGC CTTTGCTGCT GTCGCGTCTG CTGGAACTCA GCGAAGCACA GGAGGGTATC
CTGAACATTG CCTTCCGGCT TGCGGATGAG CAGGGGCTGC CGCTGCTGGA TCTCAAGGAT
CTACAAGCGC TGTTGGTCTG GGTCGGCGAG AACCGCGAAA GCCTGTCCCT GCGCTACGGC
AATGTCTCCA CCGCTTCGAT CGGCGCCATC CAGCGCCGCC TGCTGGTTCT GGAGAACCAG
GGCGGGGCGC TGATCTTTGG CGAGCCGGCC TTGGATCTTG AGGATCTGAT GCGCTTTGAT
GCTGCGGGCC GGGGCATGGT GAACATTCTG GCCGCGGATA AATTGATGGC TTCTCCAAAG
CTTTACGCGA CGTTCTTGCT GTGGCTGTTG AGCGAGCTGT TCGAGAGCCT TCCTGAAGTC
GGAGATCCGG AAAAGCCCAA GCTGGTGTTC TTTTTCGACG AGGCGCATCT CCTGTTTGAA
GACGCACCCA AAGCCCTCAT CGACAAGGTG GAACAGGTCG CACGGCTGAT CCGCTCCAAG
GGGGTTGGGA TCTATTTTGT CACCCAGTCT CCGGACGACA TTCCTGAGGA TATTCTGGGG
CAGTTGGGCA ATCGCATCCA ACATGCGCTG CGTGCGTTTA CGGCGCGGGA TCAGAAGAAG
CTGAAGCTTG CCGCAGAAAC CTATCGTGCA AACCCGCGTT TTTCGACCGA AGACGCCATC
CGTGAGGTCG GCGTCGGCGA GGCGGTGACC TCCATGCTCG AGAAAAAGGC CGTACCTGGC
GTGGTGGAGC GGACGCTTAT TCGCCCGCCC TCGAGCCAGC TTGGACCGAT CACCGAAGAG
TTCCGCAGGA GTGTAATGCA AGCGTCTGAT ATGGCGGGAA AATATGACAA GTCTGTTGAT
CGCCATTCAG CCTATGAAAT CCTGAAAGAG CGGGCGGACA AAGCCTCGAG AGAAGCGGCA
GACGCCGAGG CGCAAGCCGA AACAGCGCCA GATCCGGTGG TGCGCGAGTT CAGCGCCGCG
CGACGGTATA GCGGCAGTCG CGTGGGGCGA TCCACCTCGC GCCGGATCGG CGGCGGTGAC
ACTTTTGCCT CCGCCATGTC CGAGTCGGTG ATCAAAGAAC TGAAAGGCAC CACCGGGCGG
CGCATCGTTC GCGGGATTCT GGGCGGGCTC TTCAAGGGGC GCTGA
 
Protein sequence
MAEEIFVGGG GDAYGDPQYL TLKYANRHGL IAGATGTGKT VTLQILAESF SNEGVPVILA 
DVKGDLSGLA RAGSETAELH APFVKRAQKI GFTSFSYHDT PVTFWDLFAQ QGHPIRTTVA
EMGPLLLSRL LELSEAQEGI LNIAFRLADE QGLPLLDLKD LQALLVWVGE NRESLSLRYG
NVSTASIGAI QRRLLVLENQ GGALIFGEPA LDLEDLMRFD AAGRGMVNIL AADKLMASPK
LYATFLLWLL SELFESLPEV GDPEKPKLVF FFDEAHLLFE DAPKALIDKV EQVARLIRSK
GVGIYFVTQS PDDIPEDILG QLGNRIQHAL RAFTARDQKK LKLAAETYRA NPRFSTEDAI
REVGVGEAVT SMLEKKAVPG VVERTLIRPP SSQLGPITEE FRRSVMQASD MAGKYDKSVD
RHSAYEILKE RADKASREAA DAEAQAETAP DPVVREFSAA RRYSGSRVGR STSRRIGGGD
TFASAMSESV IKELKGTTGR RIVRGILGGL FKGR