Gene TM1040_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0938 
Symbol 
ID4077566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1001097 
End bp1002125 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content61% 
IMG OID638006241 
Productagmatinase 
Protein accessionYP_612933 
Protein GI99080779 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.738745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.133276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGCC CGCCTGCATT GGCTAGGACG TCAGCAGAAC CCGAGAACAG GGAGCAGTCC 
AAAATGGCGC TGGAAGACGC AAAAACGCAG GTAGATCAGG CATTTACGCG CGAGGACCTT
AAGGGTCTCA GCTTTGAGAA CACCTTTGGA GGGGCCACGT CGTTTCTGCG GCGTCGCTAC
ACCAAGGATC TGACCCACGC GGATATCGCC GTGACGGGTA TTCCGTTTGA TCAGGCGGTG
ACAAACCGCC CCGGCACACG GCTGGGTCCG CGCGCGGTGC GCGAGGCCTC GGCCCTGCAA
AGCCCCGATG CGCCTTATGG CTGGGATATT TGCCCTGCAA GCGAGCTGGC AATCGTGGAC
TACGGCGATC TGGCCTTTGA CTATGCCAAT GTCCCGGCGT TTCCCGACAC GCTCACGGAT
CATATCCGGG GCATTCTGGC CACCGATACA GCATCGGTGG CGATTGGCGG CGACCATTAT
GTGAGCTTCC CGATCCTCAA GGCCTATGCC GAGAAATACG GGCCTGTTTC GCTCTTGCAT
TTCGATGCTC ACAGCGACAC CTGGGCGGAT GATGATTTCA GCCGCGTGGA TCATGGCACG
ATGTTCTACA AAGCGGTGAA ATCCGGCATC ATTGACCCGG CTACCTCGGT GCAGGTGGGC
ATTCGCACCA CCAATGAGGA CAATCTGGGC GTGCCGACTA TCGATGCCCC CACGGTGCAT
GAGATCGGCC CGGTCGAGAC CGCGCGCCGC ATCCGCGAGG TGCTGGGCGA CCGGCCAGTC
TATCTGACCT TTGACATTGA CTGCCTCGAT CCGGCCTATG CGCCGGGCAC TGGCACCCCG
GTCTGGGGTG GCCTCACCTC TGCGCAGGCA GAGGCGATCC TCAAGGGGTT GCGCGGCATC
AACATTCTGG GCGGCGACGT CGTCGAAGTC TCGCCCCCCT TTGACACCAC CGGCGCCACC
GCCATCGCGG GCGCCCATGT GGCCATGAGT ATTATCTGCC TTCTGGGCTG GAGGATGACA
GGACGATGA
 
Protein sequence
MTSPPALART SAEPENREQS KMALEDAKTQ VDQAFTREDL KGLSFENTFG GATSFLRRRY 
TKDLTHADIA VTGIPFDQAV TNRPGTRLGP RAVREASALQ SPDAPYGWDI CPASELAIVD
YGDLAFDYAN VPAFPDTLTD HIRGILATDT ASVAIGGDHY VSFPILKAYA EKYGPVSLLH
FDAHSDTWAD DDFSRVDHGT MFYKAVKSGI IDPATSVQVG IRTTNEDNLG VPTIDAPTVH
EIGPVETARR IREVLGDRPV YLTFDIDCLD PAYAPGTGTP VWGGLTSAQA EAILKGLRGI
NILGGDVVEV SPPFDTTGAT AIAGAHVAMS IICLLGWRMT GR