Gene Cyan8802_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1395 
Symbol 
ID8390707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1427291 
End bp1428880 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content43% 
IMG OID644979398 
Producttryptophan halogenase 
Protein accessionYP_003137148 
Protein GI257059260 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAA CAACATCAAG ATCCCCTCAA GCTATTGAAA ACGTATTAAT TGTCGGAGGG 
GGGACTGCCG GCTGGATGAC GGCAACTTAC CTTAATAAGG CATTTGGACC TCAAGTCAAA
GTAACCCTGA TGGAATCTCC TAGTGTTCCA CGTATAGGCG TAGGAGAGGC AACTGTACCC
AACTTACAAA GGACTTTTTG GGACTTCCTA GGCATTCCTG AACGGGAGTG GATGAAAGAA
GTCAATGGAG CTTTCAAAAC AGCAGTTCGC TTTGTCAATT GGCGAAAACC CAAATCAGGA
GAAGGGGTTA ATCACTTCTA CCATCCCTTT GGCATTTTGC CTAACCTTGA GGGGGTTCTT
CTGCCTCACT ACTGGTATCA CCTGACTAGA GGCACAGATC CAGTTGATTA TTCCTGTTTC
CGTGAACCTC CTTTGATGGA CGCGAAAAAA GCCCCTGTTT ATAGGGATGG TACCTCCGCC
GTACCCCATG CTTGGCACTT TGATGCCCAT CTGGTGGCTA AATTCCTGAG TAACTGGGGT
AAAGAACGGG GCGTTGTACA TATCTTAGAT TATTTAGAAA ATGCTACCCT CGATGAGCAA
GGCAATATTG CCTCTATTCA AACCCGAAAT GGATTAACCT TAGAAGCCGA TCTGTTTATC
GACTGTACTG GATTTCGTGG CTTGTTGATC AATAAAGCTT TGAATGAACC CTTTATTGAT
ATGAATGATC ACTTGCTCTG TGATAGTGCA GTGGCTGCTG CTATTCCTTC TAATGATGAA
AGGGATGATA TTGAACCTTT TACCAGTGCT TTTGCCCAAG AAGCCGGTTG GATTTGGAAA
ATTCCCATGA TGGGGCGTTT TGGTTCAGGC TATGTTTATT GTAGTCAGTT TCTCAGCGAA
GACGAAGCAG CAACCAATTT CTGCAAGTTT TGGAACGTTG ATGAATCGAA AACGAATCTC
AACCGTATTC GCTTTAGAAC CGGTCGCAAT CGCCGAGCTT GGGTCAAAAA CTGCGTTAGT
ATTGGACTTT CTTCGTGTTT CTTAGAGCCT TTGGAGTCCA CAGGAATTTA CTTTATCACG
GGTGCGATTT ATCAGTTGGC CAAGTATTTT CCTAGCAAGC AGATGGAACC CGCTTTACGA
GATAAATTCA ATGAAGAAAT TGAATTTATG TACGACGACT GTCGGGACTT TATTCAAGCT
CACTACTTAA TCACAACACG AGATGATAGT CCTTTTTGGT TAGCTAATAA GCACGAACTG
ACTATGAGTG ACTCGATTAA AAACAAGCTT GAACTCTATA AAGCGGGACT GCCCGTTTCT
CCCTTGCCTT CGAGTGAAAA GGATTATTAT GCTAACTTAG ATAACGAATT TCATAACTTT
TGGACTGATG GTAGCTACTA TTGCATCCTA TCTGGTTTGG GCTGTTTTCC TGAACAATCC
CATCCCTATC TTCGGGATCA TCCAGAAACC GTTAGAGAGT CAGTTGAAGT TTTTACTAAG
ATTAAGGAAC AGCAACAAGA ATTATTAGAA GAGTTGCCGA GTAATTATGA ATACCTCAGA
CAACTTCATA AAGTTGATCA TCTGGTCTAA
 
Protein sequence
MQTTTSRSPQ AIENVLIVGG GTAGWMTATY LNKAFGPQVK VTLMESPSVP RIGVGEATVP 
NLQRTFWDFL GIPEREWMKE VNGAFKTAVR FVNWRKPKSG EGVNHFYHPF GILPNLEGVL
LPHYWYHLTR GTDPVDYSCF REPPLMDAKK APVYRDGTSA VPHAWHFDAH LVAKFLSNWG
KERGVVHILD YLENATLDEQ GNIASIQTRN GLTLEADLFI DCTGFRGLLI NKALNEPFID
MNDHLLCDSA VAAAIPSNDE RDDIEPFTSA FAQEAGWIWK IPMMGRFGSG YVYCSQFLSE
DEAATNFCKF WNVDESKTNL NRIRFRTGRN RRAWVKNCVS IGLSSCFLEP LESTGIYFIT
GAIYQLAKYF PSKQMEPALR DKFNEEIEFM YDDCRDFIQA HYLITTRDDS PFWLANKHEL
TMSDSIKNKL ELYKAGLPVS PLPSSEKDYY ANLDNEFHNF WTDGSYYCIL SGLGCFPEQS
HPYLRDHPET VRESVEVFTK IKEQQQELLE ELPSNYEYLR QLHKVDHLV