Gene Strop_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1108 
Symbol 
ID5057555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1253187 
End bp1254545 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content76% 
IMG OID640473375 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001157957 
Protein GI145593660 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGCT GGCTCGCCGA GTACGCGTGG CTCCCCGAGC AGCCCGAGCC GACCCCGGAC 
GTGCTGATCG AGACCGCTGC CGGCCGGATC ACCGGGGTGA CCCCGCTCGC GCCCGAAAGC
CGGCCGACCA CCGGGATCGA GGTCCTCGCC GACGCGGTCC GCCTGCCCGG GCTGACCCTG
CCGGGGCTGG CCAACGCGCA CTCGCACGCC TTCCACCGCG CGTTGCGCGG CCGCACCCAC
GGCGGTCGCG GCGACTTCTG GACCTGGCGG GACCGAATGT ACGAGGTGGC CGCCCGGCTG
GACCCGGAGA GCTACCTCGC GCTCGCCCGC GCCGGGTACG CGGAGATGGC GCTGGCCGGC
GTCACCTGCG TCGGCGAGTT CCACTACCTG CACCACGGCC CGGACGGCAC CCCGTACGCG
GACCCGAACG CGATGGGGGC CGCCCTGGTC GAGGCGGCAG CGCACGCCGG GATCCGGCTG
ACCCTGCTGG ACGCCTGCTA CCTGACCGCC ACCGTCACCG GCGATCCGCT GGCCGGGCCG
CAGCGACGCT TCGGCGACGG TGACGCCCTG CGCTGGGCGG AGCGGGCGGC GGCGTTCGCC
CCCACCGAGG CGCACGTACG GGTCGGCGCG GCGATCCACT CGGTACGCGC CGTGCCCGCC
GACCAACTGG CGACGGTGGC CGGCTCGGCG CAGGAGCGGG GCGTCCCGCT GCACGTGCAC
CTCTCCGAGC AGCCGGCCGA GAACGACGCC TGCCGGGCCG CGCACGGCTG CACCCCCACC
CGCCTGCTGG CCGACCGGGG CGTCCTCGAC CAGCACACCA CCGCCGTGCA CGCCACCCAC
CCCACCAGCT CGGACGTGGC CCTGCTCGGG GAGAGCAACA CCGGGGTCTG TCTCTGCCCC
ACCACCGAGC GGGACCTCGC CGACGGGATC GGACCGGCCC GCCGGATGGC CAACGCCGGC
ACCCCGCTGA GCCTCGGCAG CGACAGCCAC GCGGTGGTGG ACCTTTTCGA GGAGGCGCGC
GCGGTGGAGC TGGATGAGCG CCTGCGCACC CGGCAACGCG GCCACTTCAC CGCCGGCGAG
CTGGTCACCG CGGCCACCGT CGCCGGGCAC GTCGCCCTCG GCTGGGGCGA CGCCGGCCGG
CTGGCCGTCG GCGACCGGGC CGACCTGGTC ACCCTCCGGC TGGACAGCCC GCGGACCGCG
GGCGTACCGG CAGCCGGCGC GTTCTTCGCC GCCACCGCGG CGGATGTCCG CCAGGTGGTG
GTGGACGGCC AGGTGGTGGT CCAGGACGGA CTGCACTGCA CCGTCGACGT CCCCACCGAG
CTGGCCACGT CGATCGCGGA GGTGACCGGT ACGCCATGA
 
Protein sequence
MTRWLAEYAW LPEQPEPTPD VLIETAAGRI TGVTPLAPES RPTTGIEVLA DAVRLPGLTL 
PGLANAHSHA FHRALRGRTH GGRGDFWTWR DRMYEVAARL DPESYLALAR AGYAEMALAG
VTCVGEFHYL HHGPDGTPYA DPNAMGAALV EAAAHAGIRL TLLDACYLTA TVTGDPLAGP
QRRFGDGDAL RWAERAAAFA PTEAHVRVGA AIHSVRAVPA DQLATVAGSA QERGVPLHVH
LSEQPAENDA CRAAHGCTPT RLLADRGVLD QHTTAVHATH PTSSDVALLG ESNTGVCLCP
TTERDLADGI GPARRMANAG TPLSLGSDSH AVVDLFEEAR AVELDERLRT RQRGHFTAGE
LVTAATVAGH VALGWGDAGR LAVGDRADLV TLRLDSPRTA GVPAAGAFFA ATAADVRQVV
VDGQVVVQDG LHCTVDVPTE LATSIAEVTG TP