Gene TM1040_3732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3732 
Symbol 
ID4075439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp790000 
End bp791391 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID638005252 
Productargininosuccinate lyase 
Protein accessionYP_611961 
Protein GI99078703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.663919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA AGACCTCGAA CCAGATGTGG GGCGGCCGCT TTGCCGCCGG ACCGGACGCG 
ATCATGGAGG CAATTAATGC CTCTATCGGG TTCGACCAGC GCATGGCAGC GCAGGATATT
GCTGGCTCTC GGGCTCATGC GGCGATGCTC GCCGCGACCG GTGTCATTAC GGATAATGAC
GCCGAGGCGA TCCGTGAAGG GCTGCTCACC GTTTTGTCAG AGATTGAAAG CGGCAGTTTT
CAGTTCTCTA CTGCGCTCGA AGACATTCAC ATGAATGTCG AAGCGCGCCT CAAAGAGATC
ATTGGCGAGC CTGCAGGTCG TCTGCATACA GGTCGCTCGC GCAACGACCA GGTCGCAACC
GATTTCAAAC TCTGGGTGCG CGACCAATTC GATGCCGCTG AAAAGGGTCT TCTCGCGCTG
ATCAAAGCGC TGGTCGATCA GGCCGAGGCT GGCGCGGATT GGGTGATGCC GGGCTTTACC
CATCTGCAAA CCGCGCAGCC GGTCACATGG GGGCATCACA TGATGGCCTA TGTGGAGATG
TTTGGCCGCG ACCTCAGCCG GGTGCGCGAT GCGCGCAAGC GCATGAACGA GTCGCCCCTG
GGTTCTGCGG CGCTGGCAGG GACTTCGTTC CCGATTGATC GCGAGATGAC CGCCAAGGCG
CTGGGGTTTG ATCGCCCGAC GGCCAATTCG CTCGATGCGG TGTCGGATCG TGACTTCGCG
CTGGAGTTCC TCTCGGTTGC CTCTATCTGC GCCATGCATC TGTCGCGCTT TGCCGAAGAA
CTGGTGATCT GGTCCTCGGC GCAGTTCCGC TTTGTGACGC TTTCGGATCG TTTCTCCACT
GGCTCCTCGA TCATGCCGCA AAAGAAAAAC CCAGACGCCG CCGAACTGAT CCGCGCCAAG
GTGGGACGGA TCTTTGGCGC TAACACGGCG CTGATGATGG TGATGAAGGG CCTGCCGCTG
GCCTATTCCA AGGACATGCA GGAAGACAAA GAGCAGGTCT TTGACGCCGC CGATAACTGG
ATGCTCGCAC TTGCTGCGAT GGAAGGCATG GTGAAGGACA TGACCGGCAA CCGCGAAAGC
CTTGCGGCCG CGGCGGGGTC CGGTTTCTCG ACGGCCACCG ATCTGGCGGA CTGGATGGTG
CGGGTCCTGA AAGTGCCGTT CCGGGATGCC CACCATGTGA CCGGCGCGCT CGTCGCGATG
GCCGAGGGCC GCGGCGTGGA TCTGCCGGAT CTGAGCCTTG AAGACATGAA GTCTGTGCAT
GAGGGCATCA CCGAGGATAT CTTTACCGTG TTGGGCGTGG AGAATTCAGT AAACTCGCGC
ATGTCTTACG GCGGCACCGC TCCCGCGCAG GTACGCGCGC AGGTGGCGCG TTGGAAAGAG
ATCTTGGGCT AA
 
Protein sequence
MTDKTSNQMW GGRFAAGPDA IMEAINASIG FDQRMAAQDI AGSRAHAAML AATGVITDND 
AEAIREGLLT VLSEIESGSF QFSTALEDIH MNVEARLKEI IGEPAGRLHT GRSRNDQVAT
DFKLWVRDQF DAAEKGLLAL IKALVDQAEA GADWVMPGFT HLQTAQPVTW GHHMMAYVEM
FGRDLSRVRD ARKRMNESPL GSAALAGTSF PIDREMTAKA LGFDRPTANS LDAVSDRDFA
LEFLSVASIC AMHLSRFAEE LVIWSSAQFR FVTLSDRFST GSSIMPQKKN PDAAELIRAK
VGRIFGANTA LMMVMKGLPL AYSKDMQEDK EQVFDAADNW MLALAAMEGM VKDMTGNRES
LAAAAGSGFS TATDLADWMV RVLKVPFRDA HHVTGALVAM AEGRGVDLPD LSLEDMKSVH
EGITEDIFTV LGVENSVNSR MSYGGTAPAQ VRAQVARWKE ILG