Gene Cyan7425_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_1230 
Symbol 
ID7287151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp1067903 
End bp1069357 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content55% 
IMG OID643584236 
Productanthranilate synthase component I-like protein 
Protein accessionYP_002481970 
Protein GI220906659 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01824] aminodeoxychorismate synthase, component I, clade 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.693571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.438932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAT TAGAAACCCA TAACTGGCAA CCCTGGTACT GGCGATCGCT CCCCCTTAAC 
CATCGCAGTG GATCGGAGAT CTTTGCCCGC CTGTTTCTGC CCCCCTCCCG TTCCCATGGG
ATTGCCACCC TATTGGAAAG TCCAGCGGAT TCGCCTTTAC CCCAGGCTCG TTATTCCATC
TGTGCCGGTG GTCCCCGTAC CGATCGCGGT TCTGCCCGCC TCTGGACGCC CCCCCTTGGC
CAAATTCTGC CCTTGCTCCG CCACCTTCTG GCCCGTCCCC TCGATCCCGA CTTACCGGAT
TTACCCTTTA CAGGGGGATG GTTGGGTTGG GTTGGCTATG ACCTGGGCTG GGAAATTGAG
CGGTTACCCT ATTTGCGCCA GGATTCCCTC CCCTTCCCTG TGGCTTACTG GTACGAACCC
GCTGCCTTTG CTGTCCTCGA TCACCATCAA CAGCAACTCT GGTTAGCTGG GAGTCGTCCA
GAACAGTTAG ACCAGTTGGA GCAGTCCCTC GATCGCCCCC CCCCTGAACC AGGCTTGGAG
CAGAGCAGAG GGGGCCTGCA CTCCCTGAAA TTTTGCTCCA CCCAGCCAGA CTATGAAGCC
GCTGTCCGCC GGGCCAAACA ATACATTCGG GCGGGGGACA TTTTTCAAGC CAATCTCTCC
CTCCGCTTTC AGGCTCAGGG GTACTGTGAT AGCTGGTCAC TTTACCGCAC CCTGCAGCAG
ATCAATCCCT CCCCCTTTGC GAGCTACTGG CAAACCCCCT GGGGAGCGGT GGTCAGTTCT
TCGCCCGAAC GGTTGGTTCA GGTCCAGGGA CGGCAGGTGC AAACCCGTCC GATCGCCGGT
ACACGGCCCC GGGGAGCTAA CCCAGAGCGA GATCAACGCC TGGCGGTGGA ATTGCTGGCC
AGCGGCAAAG ACAATGCTGA ACACATCATG TTGGTGGATC TGGAGCGCAA TGATCTGGGA
CGCATCTGCG ACTGGGGCAG CGTAGAAGTA AACGAATTTC TGCAGCTAGA AACCTATAGT
CATGTCATCC ATCTGGTTAG TAATATTGTG GGCAGACTTC AGCCGAATGC CGGAGCCATT
GAGGTGATTC GCTCCCTCTT TCCGGGGGGA ACCATTACCG GATGTCCTAA AGTTCGCTGT
ATGGAAATTA TTGAAGAACT CGAACCCGTT CGTCGCAATC TCTTTTACGG TTCCGCTGGC
TACCTGGATC AACGGGGAAA TCTGGATTTG AATATTTTGA TTCGCACCTT ATTGCTGAAC
CCTCTGCAAC CAGGCGATCC TAGAACGGAG CTACAGATTT GCGGCCAGGT GGGAGCTGGA
ATTGTTGCCG ACAGTCAGCC GGAGCTGGAA TGGCAGGAAT CACTCCAAAA AGCTCAGGCT
CAACTCCTCG CTCTAGAAGA ATTGAGTAAC CAGGGGAAAA GAGTTTTAAG TTTGAGTAAC
CAAAAGGAAT TATAA
 
Protein sequence
MEPLETHNWQ PWYWRSLPLN HRSGSEIFAR LFLPPSRSHG IATLLESPAD SPLPQARYSI 
CAGGPRTDRG SARLWTPPLG QILPLLRHLL ARPLDPDLPD LPFTGGWLGW VGYDLGWEIE
RLPYLRQDSL PFPVAYWYEP AAFAVLDHHQ QQLWLAGSRP EQLDQLEQSL DRPPPEPGLE
QSRGGLHSLK FCSTQPDYEA AVRRAKQYIR AGDIFQANLS LRFQAQGYCD SWSLYRTLQQ
INPSPFASYW QTPWGAVVSS SPERLVQVQG RQVQTRPIAG TRPRGANPER DQRLAVELLA
SGKDNAEHIM LVDLERNDLG RICDWGSVEV NEFLQLETYS HVIHLVSNIV GRLQPNAGAI
EVIRSLFPGG TITGCPKVRC MEIIEELEPV RRNLFYGSAG YLDQRGNLDL NILIRTLLLN
PLQPGDPRTE LQICGQVGAG IVADSQPELE WQESLQKAQA QLLALEELSN QGKRVLSLSN
QKEL