Gene TM1040_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3000 
Symbol 
ID4078030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3167703 
End bp3169283 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content57% 
IMG OID638008329 
Producthypothetical protein 
Protein accessionYP_614994 
Protein GI99082840 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.592771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.37885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGG CTCAGGAACT ACCGATTGAT ACAACCGCCT CTGCCGAGGA CATGGCGGAA 
GCCATTTTTG GCAATGGCAT CGAGATTCAA TCCGCGACCT ACACCGGCGC TAACGGTGCG
TCCGGCATCT ACAGCGGCGG TGATTCAATC ACGCCGGGTG TGACCCCATC GGACACTGGC
GTTATCCTGT CCACTGGCAA CGCAACGGAT TTCACCAATA GCTCGGGCAG CACGGATACC
AATACCTCTG CCGGGACATC CACCAATCAC GGAACCGCGG GCGATACGGA TCTGGATGCG
ATCGCAGGTG GAACGACCTA TGATGCCGCC GTCTTTGAGG CCGAATTTGT GCCCGAAGGC
TCCACACTGA CGATGCAGTT CGTCTTTTCC TCCGAAGAGT ATCTCGAATA CGTCAATTCA
GGCTTCAACG ATGCGGTGTC GGTCTGGGTC AATGGTGTTG AGGCGGAACT GGTAGTGGGC
GATGGCAATA TCGCCATCGA CAACATCAAT GATACGTCCA ATTCGGATCT CTACGTCGAC
AACCCCGCGG ACAGTGATCT CTACAACACT GAAATGGACG GGTTTACCGT CACGTTGAAG
CTGACAGCGC CGGTCAACCC AGGCGAAACC AACCACATCA AGATTGCCAT CGCCGATACA
GGAGATGCCG CTTACGATTC CAACCTGCTG ATTGCCGGGG ACTCGGTGCA GACGGCGTTG
GTTGCGGAAA ACGATCAGGT GCTGGTCTGG GAGGGCTCAT CCACGACCGT TGACGTGCTT
GCCAATGATG CGCAAGGGTC AGGCGCTCAG CTCACCATCA CACATATCAA CAACCAGCCT
GTCACGGCTG GCAGTTCGGT GACATTGGCA AATGGCGAAA TCATCACCCT CAACGAAGAC
GGCACCCTCG AGATCACAGG CCAGCCCGAC GTGGATACGG ATGAGACGTC CGTCTTCACC
TACACGGTGG CCGATGATGA AGGAAATACC GACGTTGCTT TTGTCGATCT GACCACCACG
CCCTGCTTTG TCGAAGGTAC CTTGATCGAT ACAACCGAAG GCCCGTGCCC GGTCGAGGCG
TTGGAAATTG GCATGACCGT CGTGACCCGT GACCACGGTG CTCAGCCGCT GCGCTGGATT
GGGCGCAGCA TTCGTACGGC GTCGGGCAAT GATGCGCCCG TTCGGATTGC CGCAAACACT
CTGGGCCAGC ACCAGGAGAT CGAACTCTCG CCGAGCCATC GAGTGCTATT GTGCTCTGCC
GCCGCCGAGA TGCTCTTTGG TCAAGAAGAA GTTTTGATCC CGGCGCATCA TCTGGTAAAT
GACAGCACCA TCCGGCGTCG CAGTGACGGG CGGCGCGTTA CCTATTTCCA CCTGCTGTTC
GACCAGCATG AAATCATTCG TGGCAATGGG CTTGAGAGTG AGAGCTATCA TCCGGGTGCC
GCGAGCCTCG GTGGATTGGA TGCAGAGACA CGGGCGGAGT TCTTCGATCT GATGGGCGAC
AGCTGGCAGA GCTATCAGAA TATGAAACGC CCGACACTCA AATCATATGA GACCCGCGCG
TTGCTGGGGG CCGTTCAGTA A
 
Protein sequence
MPTAQELPID TTASAEDMAE AIFGNGIEIQ SATYTGANGA SGIYSGGDSI TPGVTPSDTG 
VILSTGNATD FTNSSGSTDT NTSAGTSTNH GTAGDTDLDA IAGGTTYDAA VFEAEFVPEG
STLTMQFVFS SEEYLEYVNS GFNDAVSVWV NGVEAELVVG DGNIAIDNIN DTSNSDLYVD
NPADSDLYNT EMDGFTVTLK LTAPVNPGET NHIKIAIADT GDAAYDSNLL IAGDSVQTAL
VAENDQVLVW EGSSTTVDVL ANDAQGSGAQ LTITHINNQP VTAGSSVTLA NGEIITLNED
GTLEITGQPD VDTDETSVFT YTVADDEGNT DVAFVDLTTT PCFVEGTLID TTEGPCPVEA
LEIGMTVVTR DHGAQPLRWI GRSIRTASGN DAPVRIAANT LGQHQEIELS PSHRVLLCSA
AAEMLFGQEE VLIPAHHLVN DSTIRRRSDG RRVTYFHLLF DQHEIIRGNG LESESYHPGA
ASLGGLDAET RAEFFDLMGD SWQSYQNMKR PTLKSYETRA LLGAVQ