Gene PCC7424_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4552 
Symbol 
ID7108302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5031083 
End bp5032609 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content42% 
IMG OID643482770 
Productanthranilate synthase component I 
Protein accessionYP_002379783 
Protein GI218441454 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGC CAACATTTGC TGAATTTGAA TCCTTAGCCA AACAAGGTAA CTTTATCCCA 
GTTTATCGAG AATGGATTGC TGATCTAGAA ACACCGGTGT CAGCTTGGTA TAAAGTCTGT
GCCGGAGAGG AATATAGTTT TCTTTTAGAA TCAGTAGAAG GAGGAGAAAC CATCGGGCGC
TACAGTTTTT TAGGCTGTGA ACCGATGTGG GTGTTGGAAG CGAGAGGAAA TACCACGACT
CAAACCTATC GCAATGGGAA CATTGAACGT TTTGAAGGGA ATCCGTTTGA AATTCTCTCT
AGTTGTATAG ACCCGATTAA ACCGGTCAAA TTACCCCAAC TTCCTCCAGG TATTGGCGGA
TTATTCGGTT ATTGGGGGTA TGAGTTAATT CGCTGGATAG AACCGCGAGT CCCTATCGGT
GAAGCTACGG AGAAGGATTT ACCCGATGGG ATTTGGATGC AAGTGGATAA TTTGATTATT
TTTGATCAAG TTAAACGGAA AATCTGGGCA ATTGCTTACG GGGATTTACG GGATGAAACC
GTGAGTTTAG AAGAAGCTTA TCAACAAGCC TGCGATCGCG TTACCAAGTT AGTGTTAAAA
TTACAGTTTC CTCTGCCGGC AGAAGGGAAA ACCCTGGAAT TATCGACAAA CAAAAACGAC
AATGCTCAAG AGTTAGAGTA TCAAAGTAAT ACCTCTAAAG AACAATTTTG TACTAATGTC
CTCAAAGCCA AAGATTATAT CCGTGCCGGA GACATTTTTC AGGTGGTTTT ATCCCAACGT
CTGAGCGCTC CTTATAAAGG GCATCCTTTC GATTTATACC GCTCTCTGCG ACTGATTAAC
CCTTCCCCAT ATATGGGTTT TTATCAATTC AAAGATTGGC AAATTATTGG GTCATCTCCT
GAAGTGATGG TCAAAGCGGA ACTCAATGAA AATAAAACCC TCAAAGCCAC ATTACGCCCC
ATTGCCGGCA CTAGACCTAG GGGAAAAACC CTTGCAGAAG ATTTAGCCTT TGAGAAAGAT
TTATTACAAG ATCCTAAAGA AATTGCCGAA CACATCATGT TAGTCGATTT AGGCCGCAAT
GATTTAGGAC GGGTGTGTAT GAAGGGAACG GTTAAAGTCG ATCAATTAAT GGTGATTGAG
CGCTATTCCC ATGTGATGCA CATCGTTAGT AATGTAGTGG GAGAATTAGC CCCCCATAAA
ACAGCTTGGG ATTTATTAAA AGCCTGTTTT CCCGCCGGAA CCGTTAGTGG CGCTCCCAAA
ATACGAGCGA TGGAAATTAT TCAAGAATTA GAACCGGAAC GTAGAGGCCC CTATTCTGGA
GTTTATGGAT ATTACGATTT TGAAGGTCAG CTTAACAGTG CCATCACCAT TCGGACAATG
ATAGTTCGTC CTGTGAGTGC CAATCAATAC ATTGTTTCTG TTCAAGCGGG AGCCGGATTA
GTGGCTGATT CAATTCCCGA AAAAGAATAC GAGGAAACAC TGAATAAAGC GAGGGGTTTA
TTAGAAGCAA TTCGTTCTTT AAAGTGA
 
Protein sequence
MISPTFAEFE SLAKQGNFIP VYREWIADLE TPVSAWYKVC AGEEYSFLLE SVEGGETIGR 
YSFLGCEPMW VLEARGNTTT QTYRNGNIER FEGNPFEILS SCIDPIKPVK LPQLPPGIGG
LFGYWGYELI RWIEPRVPIG EATEKDLPDG IWMQVDNLII FDQVKRKIWA IAYGDLRDET
VSLEEAYQQA CDRVTKLVLK LQFPLPAEGK TLELSTNKND NAQELEYQSN TSKEQFCTNV
LKAKDYIRAG DIFQVVLSQR LSAPYKGHPF DLYRSLRLIN PSPYMGFYQF KDWQIIGSSP
EVMVKAELNE NKTLKATLRP IAGTRPRGKT LAEDLAFEKD LLQDPKEIAE HIMLVDLGRN
DLGRVCMKGT VKVDQLMVIE RYSHVMHIVS NVVGELAPHK TAWDLLKACF PAGTVSGAPK
IRAMEIIQEL EPERRGPYSG VYGYYDFEGQ LNSAITIRTM IVRPVSANQY IVSVQAGAGL
VADSIPEKEY EETLNKARGL LEAIRSLK