Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_5752 |
Symbol | |
ID | 7112930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011738 |
Strand | - |
Start bp | 147064 |
End bp | 150273 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643484038 |
Product | amino acid adenylation domain protein |
Protein accession | YP_002381047 |
Protein GI | 218442727 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTA ACACAATTGA AAATATTTAT GAACTCTCAG CCATGCAACA GGGAATACTG TTTCACAGCT TGTATGAAAC AGACTTCGAC TCTTACTGCT TTCAATTTTG TTATACTCTC TGTGGGAAAT TGAATGTTAG TGCCTTAGAG CAAGCTTGGC AACAAGTTAT AGAGCGACAT ACGATTTTAC GCACCTCTTT CCATTGGGAA GAATTAGAGA AACCCTTACA AGTCGTACAT CCACAAGTAT CAATCCCCCT AAAACAATAC GATTGGTCTC ATCTGTCTTC AGAGGAGCAA CAGAAACAGC GAGCAGAATT TCTTAAAGCA GACCATAAAC GGGGCTTTAA TTTTGCCGAA CCTCCTCTAA TGCGTTTAAC TTTAATTAAA CTTGCCCCAG AAACCTATCT ATTCGTTTGG AGTAAACATC ATTTAATACT TGATGGGTGG TCAACCGCCC GAGTTGTGAA AGAAGTTTTT GAGAGTTATG AAGCCCTCAT GCTAGGTCGA GATTGTCCCT TATCTCCCTC AAGCCCTTAC TCAAATTATA TTGATTGGGT ACAACAACAA GACATCTCCC AAGCCGAACA GTTTTGGCGG CGAAATCTAA AAGGATTTAC TGCCCCGACT TCTTTGAAAG CTAATCAGTC TCAAAGGGAT TCCAAAGCGA GTTACTATCG GGAGCGAATT GAACTTTCGT TGCAAACCAC AGAAGCTTTG CAGTCATTAG CAAAACAACA TCAGCTAACT TTAAATACCC TTGTACAGGG TGCTTATGCT CTATTGCTCA GTCGCTATAG TGGAGAGGAA GATGTTCTAT TCGGCTCGGT GGTTTCTGGT CGTCCCCCTG AATTAGAGGG AGTAGAATCA ATGGTCGGGT TATTTATTAA TACTCTTCCG GTTCGGGTGC AAGTCTTACC TGAAATGTCC CTTCTCGATT GGCTTAAACA AGTCCAAAGC CAGCAGTTAG AACTCCGTCA GTATCAGTAC AGTCCCCTTG TAAAGGTACA GGCATGGAGT GAAATTCCGT CAGGTGTGCG GATGTTTGAG AGTTTGGTCG TATTTGAAAA TTATCCCCTC GATAGATCCC TAGGCTCATC AGGAAAGAGC ATAGAGGTAA AAGAGGCTTA TTCTCTCCAA CAGCCGACTT ATCCTTTGGA AATTATCGCC ACCGTAGACC GCCAACTTTA CTTAATTGTT AACTGCGATG GCCGGTATTT CGACTCAGCC AGGGCTAGCA AAATATTAGC TCATTTAAAA ATGCTACTCA AAGGCATGGT TGCCAATCCT CATCAACGCC TTGGTTCATT ACCTTTACTA ACACAGACAG AGCAACAACA CCTGCAAGCT TGGAATAACA CTAAAACCCC CTATCCTCAA CAGTGTATTC ATCAGTTATT TGAACTTCAA GCGACTCAAA CCCCTCAATC TGTGGCGGTT ATTTTTCAAG ATCAGCAATT GACCTATCAA GAACTTAACG AACGAGCCAA TCAAGTTGCT AATTACCTAA AACACCTTGG GGTAGGAACT GAAGACCTCG TTGGAATTTA TCTATTTAGC TCAATAGAAA TGATAGTAGG ACTGCTTGGC ATCTTAAAAG CAGGTGGAAC TTATTTACCT TTAGATCCAA GCTATCCTCA ACAAAGACTC GCTTTAATGT TGGAAGATGC ACAAATTTCT CTGTTATTAA CTAACAACCA GCTACGGGAA CAAATACCCG AATTTACCGG CAAAACTATC TGTCTCGATG GAGATTGGTC GAAAATAACT GAGCAAAATA AAGAAAATCT CTTAACTCAG ACTACCCCCG ATAATTTAGC CTACGTCATG TATACTTCCG GCTCAACAGG CACTCCTAAA GGGGTGTGTA TTCCCCATCG TGGCGTTGTG CGGTTGGTAA AAAATAATCA TTACGCTAGT TTAAATTCAT CAGAAGTCTT TTTACAGTTT GCCTCGATTT CCTTTGATGC CGCTACCTTT GAGATTTGGG GCTGTTTGCT CAATGGTGCT AGGCTCGTTT TATTCCCCGA AAAAGAGTTT ACTTTGTCAT CCTTGGGGAA AGTTGTCCAA GATTATGAAG TAACAACTCT GTGGTTAACC GCCGGACTAT TTCACTTGAT GGTAGATCAA CAACTCGAAA GCTTGCGAGG GTTACGACAA CTTTTAGCCG GCGGAGATGT TCTTAATCCC AATCATGTTC GTAAATTTAT TAATCAATAC AAAGATTGTC GTCTAATTAA CGGCTACGGG CCGACAGAAA ATACTACTTT TACTTGTTGT TATTCGATAA CTGATGACAC TCAATGGGAA ACCTCTGTGC CGATTGGTTA TCCAATTGCT AATACTCAAG TTTATGTTTT AGATCGCTAT TTACGACCTG TTCCCATTGG CATTGCAGGG GAACTTTATA TAGGAGGAGA AGGATTAGCC CGTTCTTATT GGAATCGTCC TGACTTAACT CAAGAGCGAT TTATTGATAA TCCTTTTCAA CCCAAAACTA AACTTTATAA AACAGGAGAT TTAGTTTGTT ATCGGTCTGA TGGCACTCTA GAATTTTTGG GTCGTCTCGA CCAACAGGTG AAAATTAGAG GCTTTCGTAT CGAACTAGGA GAAATTGAAT CCACTTTATC CGAACATCCT GCGGTAGCTG AAGTAACACT GGCTCTGAAA GAGGATACTA AGGGGGAAAA ACGAATTGTT GCTTATGTGG TTTGTCATCG AGAAAAAGCC GTATCAGTGA AAGATTTACG AGATTTTTTG CAAAAAGTCT TGCCAGATTA TATGTTACCT TCAGTGTTTG TATTTTTGGA CAAACTGCCT CTAACCTCTA ACGGCAAAGT AGACCGTCAA GCTTTGCCTG TTCCTGATTT TACTCGTCCT GCTTTATCTG CCGAGGCCAC AGCACCACGT ACTCCCCAAG AAAAACAACT CGCCGAGATT TGGGCTAACG TCATGAATCT TGAACAAGTC GGTATTCATG ATAACTTTTT TGAATTAGGT GGACATTCTT TAGTCGCTAT TCAAATTATT TCCCGTATCC GCGAAGTTTT TGCGATAGAT TTAGGCTTAA ATAGTTTATT TGAAACTCCA ACAATTGCAC AATTAGCCCA AATTATTCAA ATTACTCAAA ATAACACGCC GATTTTAGAT AAAATTGTGC CCATTTCCCG TGACTCTTAT CGTCAACGAC GCTCTCAACT ACATAACTAA
|
Protein sequence | MNTNTIENIY ELSAMQQGIL FHSLYETDFD SYCFQFCYTL CGKLNVSALE QAWQQVIERH TILRTSFHWE ELEKPLQVVH PQVSIPLKQY DWSHLSSEEQ QKQRAEFLKA DHKRGFNFAE PPLMRLTLIK LAPETYLFVW SKHHLILDGW STARVVKEVF ESYEALMLGR DCPLSPSSPY SNYIDWVQQQ DISQAEQFWR RNLKGFTAPT SLKANQSQRD SKASYYRERI ELSLQTTEAL QSLAKQHQLT LNTLVQGAYA LLLSRYSGEE DVLFGSVVSG RPPELEGVES MVGLFINTLP VRVQVLPEMS LLDWLKQVQS QQLELRQYQY SPLVKVQAWS EIPSGVRMFE SLVVFENYPL DRSLGSSGKS IEVKEAYSLQ QPTYPLEIIA TVDRQLYLIV NCDGRYFDSA RASKILAHLK MLLKGMVANP HQRLGSLPLL TQTEQQHLQA WNNTKTPYPQ QCIHQLFELQ ATQTPQSVAV IFQDQQLTYQ ELNERANQVA NYLKHLGVGT EDLVGIYLFS SIEMIVGLLG ILKAGGTYLP LDPSYPQQRL ALMLEDAQIS LLLTNNQLRE QIPEFTGKTI CLDGDWSKIT EQNKENLLTQ TTPDNLAYVM YTSGSTGTPK GVCIPHRGVV RLVKNNHYAS LNSSEVFLQF ASISFDAATF EIWGCLLNGA RLVLFPEKEF TLSSLGKVVQ DYEVTTLWLT AGLFHLMVDQ QLESLRGLRQ LLAGGDVLNP NHVRKFINQY KDCRLINGYG PTENTTFTCC YSITDDTQWE TSVPIGYPIA NTQVYVLDRY LRPVPIGIAG ELYIGGEGLA RSYWNRPDLT QERFIDNPFQ PKTKLYKTGD LVCYRSDGTL EFLGRLDQQV KIRGFRIELG EIESTLSEHP AVAEVTLALK EDTKGEKRIV AYVVCHREKA VSVKDLRDFL QKVLPDYMLP SVFVFLDKLP LTSNGKVDRQ ALPVPDFTRP ALSAEATAPR TPQEKQLAEI WANVMNLEQV GIHDNFFELG GHSLVAIQII SRIREVFAID LGLNSLFETP TIAQLAQIIQ ITQNNTPILD KIVPISRDSY RQRRSQLHN
|
| |