Gene CPS_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_1022 
Symbol 
ID3522939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp1045984 
End bp1047558 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content38% 
IMG OID637283487 
Productputative tryptophan halogenase 
Protein accessionYP_267771 
Protein GI71282464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCCT TAGATAATAA TCATATTCTT ATAATTGGCG GTGGCACTGC TGGTTGGTTA 
AGTGCGGCAA TACTTGCCAA AACGTTGAAT AGTAAAAATA CTGATGGTGT TAAAGTCACT
TTGGTTGAAT CGCCGACCAT TCCAATTTTA GGTGTTGGAG AAGGCACATG GCCAAACTTA
AGAGCGACAT TACATAAAAT AGGTATTAGC GAAACAGACT TCATTCGTGA ATGCGATGCG
ACCTTTAAAC AAGGTGCAGA GTTTATTAAT TGGTCTAAAA CGCCAGAGCC AAAACAATCA
CACAGTTATT ATCACCCACT CAGTACGGTT AGCCATTCTT CATACGATTT TAACTTAGCC
CCTTATTGGT TACAACAAGA TAAAAAAACA CGCTTACCTT ATGATAGAGC TGTTGCATCA
CAAGCAAGAG TTTGTGATGA AGGACTAGCA CCTAAACAAA TTGTGATGGC AGAATATAGT
GCCGCGCAAG AGTATGCCTA TCATTTAAAT GCGAATAAAT TGGCCGAGTT TTTAAAACGG
CATTGCGTTG AGAAACTTGG GGTTAAATTT GTCAGTGCCA ATGTCACCAA TGTAGCGCTA
GATAATGAAG ACTTTATCAC GCATGTAGAC ACTGACCATG AAAGTGAAAA GAAAATTTTC
GCTGATTTTT TTGTCGATTG TAGCGGTGCG AAAGGGTTAA TCATTAAAGA AACCTATAAC
ACAGCTTGGC AAAGTATTAG CGATGTTATT TTTAATGATA CCGCCTTAGC AGTACAAGTA
CCTTATGCTG ATAGAAATCA AAAAATAAAT ACCCATACTC TCGCAACAGC CCAAGAAGCA
GGTTGGATAT GGGATATAGG TTTACAGGAC CGTCGTGGAG TTGGTCATGT ATTTAGTAGC
AAGTACATCT CAGATCAAAA AGCCGAGCAA CAACTTATTG ATTATCTAGG GGATGATTAC
AGCGACGATT TGACTATTCG TAAAATAAAG CTCAACCATG GTTACCACAA GAAGTTCTGG
CATAAAAACA GTGTGGCTAT TGGTATGTCG GCGGGCTTTG TTGAGCCACT TGAAGCATCG
GCTATTTTCT TATTTGATGC CGCAGCTAAT ATGCTTGCAG CACAGTTTCC TCGTGATAAA
GCACAAATGA AATATGCTGA AGACAAATTT AATCAGCAAT TAACGATGCG TATGCAGCGT
ACGGTTGAGT TTATTAAATT GCATTACTGT ATTTCTCAAC GCCGAGATAG CCAATACTGG
ATTGATAACT GTGACCCAAT CAGTATCCCT GATAACTTAA AGCAACGACT GGCATTTTGG
CAAGGACAAG TACCAACCAA ATATGACTTT GAAAACGCTT GGGAACCCTT TAATTTAGAC
AGTTATCTTT ATGTTCTATA TGGTATGGGG TTTGAAACTG ATGTAGCTAA AGTTGCAGCT
AAATATACTG AAACAACTAA AGCTAAGCAC TTATTTAATA ATATTGATAA AGCCAGTGTG
CTGTTAATCG ATAAGTTACC TAAGCAAAGA GAGCTGATTG AAAAAGTAAT TAAATATGGG
TTTACTCAAG TATAG
 
Protein sequence
MMALDNNHIL IIGGGTAGWL SAAILAKTLN SKNTDGVKVT LVESPTIPIL GVGEGTWPNL 
RATLHKIGIS ETDFIRECDA TFKQGAEFIN WSKTPEPKQS HSYYHPLSTV SHSSYDFNLA
PYWLQQDKKT RLPYDRAVAS QARVCDEGLA PKQIVMAEYS AAQEYAYHLN ANKLAEFLKR
HCVEKLGVKF VSANVTNVAL DNEDFITHVD TDHESEKKIF ADFFVDCSGA KGLIIKETYN
TAWQSISDVI FNDTALAVQV PYADRNQKIN THTLATAQEA GWIWDIGLQD RRGVGHVFSS
KYISDQKAEQ QLIDYLGDDY SDDLTIRKIK LNHGYHKKFW HKNSVAIGMS AGFVEPLEAS
AIFLFDAAAN MLAAQFPRDK AQMKYAEDKF NQQLTMRMQR TVEFIKLHYC ISQRRDSQYW
IDNCDPISIP DNLKQRLAFW QGQVPTKYDF ENAWEPFNLD SYLYVLYGMG FETDVAKVAA
KYTETTKAKH LFNNIDKASV LLIDKLPKQR ELIEKVIKYG FTQV