Gene TM1040_3607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3607 
Symbol 
ID4075034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp657267 
End bp658217 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content62% 
IMG OID638005126 
Productasparaginase 
Protein accessionYP_611836 
Protein GI99078578 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.143302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT GCGTAATCCA CACCGGCGGC ACCATTGGCA TGGCCCCCTC GCCAGAGGGG 
TTTGCCCCCA AAACAGGAAT CGTGGAGGCG GAGCTGGACC GTTTGCAGCG CATCGGCGCA
ATAGAGGCTG ATTTCAGGGT TGTGACGGCA TCCCCTTTGA TCGACAGCGC CAACGCTACC
TCGGCGGATT GGAACTGGAT CATGGCGCAG ATCGCAGCGC ATGATGATGA TTGCGCGGGG
TTTGTGGTGA CACATGGCAC CGACACACTC GCCTTTACAG CCGCTGCTTT GTCCTTTGGT
CTCAAAGGGT TGCGCAAGCC GGTTGTGATC ACCGGAGCGA TGCTGCCACT TTCCGAGGAG
GGCAGCGACG GCAGCGACAA CCTCCGAGAC GCGTTTCGCG CGGTCGAACA GGCTGCGCCC
GGCGTCTGGG TCCAATTTGC AGGGAAGTTG CTGCATGGCG CGCGGGTGCG AAAATCGCAT
TCGGTGGCCT TTGACGCATT CAACGCATCA CCAACGGATG CCGCCCCCCT CCGGGCGGCC
GAAACTCTCG GCATTTTTGA ATACGGCGAT GCCACCGCGC TGATTGCGGC GGTGGCTCCG
GGAATGAATG CCTCCTTGAT TTCTTATGCG GTAGAACAGG CCGCGGGGAT TGTACTGCGC
TGTTACGGCT CTGGCACCGT GCCCGAGGGC CTGGGGCTGC GCAAAGCCAT GTTGCAAGCA
CGAGACACCG GCGTTCCGGT CCTCGCCGTG AGCCAATGTG CCGAGGGCGG CATTTCCCTT
GGTACTTATG CGGCGGGTGC AATGCTTGCG CAGACCGGAG CTATCGACGG GCGCGACATG
ACCGTCGAGG CGGCCTATGC CAAGCTGCTG CATGCGCTGT CGCAAAGCGC AGATCTGGCG
ACTCGGCGCG AGATCCTTGA AACCCGCCTA TGTGGAGAAT GGGCCTTGTA G
 
Protein sequence
MTICVIHTGG TIGMAPSPEG FAPKTGIVEA ELDRLQRIGA IEADFRVVTA SPLIDSANAT 
SADWNWIMAQ IAAHDDDCAG FVVTHGTDTL AFTAAALSFG LKGLRKPVVI TGAMLPLSEE
GSDGSDNLRD AFRAVEQAAP GVWVQFAGKL LHGARVRKSH SVAFDAFNAS PTDAAPLRAA
ETLGIFEYGD ATALIAAVAP GMNASLISYA VEQAAGIVLR CYGSGTVPEG LGLRKAMLQA
RDTGVPVLAV SQCAEGGISL GTYAAGAMLA QTGAIDGRDM TVEAAYAKLL HALSQSADLA
TRREILETRL CGEWAL