Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_5117 |
Symbol | |
ID | 7108016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 5681443 |
End bp | 5684460 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643483327 |
Product | Ig domain protein group 1 domain protein |
Protein accession | YP_002380336 |
Protein GI | 218442007 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0644734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACAA TTTTTACGAC TCAAACTCCG TCCAACCCTG AATGGACTGA CAATGTTGCT TACGAATTAG GAATGAAATT CAGCAGCACA GAAACTGGAC AAATTACCGC TATTCGTTAC TGGAGAGCTA ACAGCGAAAC CGGAACTCAT ACGGGTAAAA TTTGGACAGC AACCGGGGAG CTTTTGGCCA GTGTTACTTT TAGCAATGAA ACCTCCTCTG GCTGGCAGGA ACAGGTTCTC AGCACACCCC TCAACATCCA AGCTAACACG ACTTATGTGG TCTCCGTTAA CTGCAATCTT TATTACGTTT ATGCTTATGA TGAACTCGCT AATCCAATTA CTAACGGAGA ACTTAGCTCA ATAGCAGATG GGAACAATGG GGTGTTTAAT GGAACTCCAG GTGCTTTTCC GGCCAATTCT TATCGAAATA CTAACTATTT TCGTGATGTC AATTTTGTTA CGGTTGCTCT GCCGACAATT ACTAAGGTTG GTGGAGATAA TCAGACGGGT GCAGCAGGAA CTGCTTTATC GACTCCCTTA GTGGTTGAAG TTAAAGATGG TTCTGGTAAT CTTTTGTCAG GGCAAACGGT GAACTTTGCC GCCACCACCG GAGGAGGTTC AGTTTCTCCG GCCAGTGCCG TAACCGACGC TAACGGACAG GCTCAGACAA CCTTAACTTT AGGATTGATC CCTGGTGCGA TCGCAAATGT AACGAATACG GTAGAAGCTA CGGCGGACGG CATCGGAAGC GTTACTTTTA CCTGCGTTGC TAGTCACTCA ACCGATAATC TCACTGTTTT GACCACTCAA ACTCCAGTAG ATGGCAATAT AACCGATGGA GTTTCTTACG AATTAGGCAT GAAATTCCGT AGTGCCAGTG GGGGGCAGAT TATAGGAATT CGCTATTGGA AAGCACCCAG CGAAACCGGA ACTCATACCG GTAAAATCTG GTCAGCAACA GGAACTCTTT TAGCGAGCGT TAGTTTTACG AATGAGACGG CCTCTGGTTG GCAATATCAA GCCTTAGAAA CCCCCTTAAA CATTCAAGCC AATACCATCT ACGTAGTTTC TGTTAACGGA AATAGTTATT ATGTTGCGAC TAACAATGGA CTGGCTAATT CGATTATTAA TGGGGATCTC AGTTCGGTAG CAGATAACAA CAATGGGGTA TTTAATTTCA ATGCTAATTC TTTTCCCACC AGTTCTTGGT TTAACAGTAA CTATTTCACA GACATCGTCT TCGTTGTCGG TAGTCGTTTA GTGAAAGTCT CTGGAGACAA TCAGAGTGGG GCGACAGGGG CTACTCTGCC TAATCCCCTA GTGGTGCAAG TCCTCGATGC ACAAAACAAT CCTGTATCAG GGCTAACCGT CAATTTTGCA ATTACCAGTG GGGGGGGATC ACTTTCAGCC AGTAGCGTAG TAACTCAGAA CGGTCAAGCT AGCACAAATT TAACTTTGGG AGCAGTCCCC ACCGCACCTG GAGGCGTAGT GGTAATAGCG ACAGTAGACG GCATAGGTTC TACTTATTTT ACTGCTACGG CAACTATTAG TAATCCGAAT GCGATTTATT TAGAAAACCT AAACCCTGGC ACAACGGCTT GGAAACTGGT CAACCGAGGT AGTGATGAGA TCGCCGGCTA TGCTTCAGCC ACCAGTATCA ATAAAGGACA ATCTATTGAT TTTAAAGTTT CTCTAGGCCA AGCTGGACAA TTTACCATTG ATGTGTACCG CTTAGGTTAC TATGGAGGCG CAGGAGGTCG GCTGATGGCC AGCAGTGGCT CACTCAACGG CACGACTCAA GCCCCTGGTG TTATCGATCC CAATACTCGT TTAATTGAAT GTAACTGGAC AACCTCTTAT ACGTTACAGA CCGGCAATGA CTGGACTAGC GGGCTTTATG TGGCTAAATT AACCGATCAA GCTAGTGGCA AAATAGCCCA TATCTGGTTT GTGGTTCGGG ATGATAGCAG CACTGGGAAA GTTTTATTTC AAAGCAGTGT CTCTACTGTA TTAGCTTATA GCACAATGGG CGGATACAGC TTATACACAA TGAACAGTAT TAATGGGCAG CGAGCTTATA AAGTGTCCTA CGATCGGCCT TTTTCTCAAG CCACTTATCA AGAATCCTAC GAAGCTGACA CGATGCTGCG ATGGGAGTAC AACATGGTGC GCTGGCTAGA ATCTCAAGCC TACGATGTGA CTTACGTGAG TAACATGGAT GTTCACACCA ACCCAAATCT GTTGCTCAAT CATCAAGTAT TTCTATCCGT TGGTCACGAT GAATACTGGT CGAAGGAAAT GCGAGATAAT GTGGAAGCCG CCCGAAATGC GGGAATTAAT CTCGCCTTTT TCTCTGCAAA TACCTGTTAT TGGCGAGTGA GATTTGAAGA TTCTACCCTC AATGCTGGAC AAGTTAGACC GAATCGAGTC ATGGCTTGTT ATAAATCAGA TTGGGATCTA GATCCTGTGG CTATTCAACA AGGGCCAAGT GCAGCCACCA ATAAATTCCG CAGTTTCCAA AATCAACGCC CAGAAAATGC CCTTTTAGGG GTGATGTATG GTAGCGATAC CCCTAATATT TATGGTGGAT TTAATATGAT CATCACCAAT AGTACAGATC CTTATTATGC CAACACGGGG CTATCGAATG GGGATCAACT CACCTTATTA GTGGGTTATG AATGGGATTT TGTGGTTAAT AATGGGTCTA GCCCTCCTGG GTTGGTCATT CTCTCTCAAT CAGGCGTTCA ACCTGCTGCT CTTTTACCGA ACTACGACGA ACCCCCAAAT GAACCCGGCC TACCCCAAAA TCAGAACTTT AACATTGCTA ATTCGGTGCG TTATACCGCC AGTGCAAAAG TTTTTGCCAG TGGAACGATT CAATGGGCGT GGGGGCTAGA TAGTGATGAT GTTAGCCCGG CGAGGGAAGA TGTGCGGGTT AAACAAATAA CCGTCAATAT TCTCGCCGAT ATGGGAGCTA CACCCCAAAG CCCCGATCCC AACATTATTG TCCCTTAA
|
Protein sequence | MTTIFTTQTP SNPEWTDNVA YELGMKFSST ETGQITAIRY WRANSETGTH TGKIWTATGE LLASVTFSNE TSSGWQEQVL STPLNIQANT TYVVSVNCNL YYVYAYDELA NPITNGELSS IADGNNGVFN GTPGAFPANS YRNTNYFRDV NFVTVALPTI TKVGGDNQTG AAGTALSTPL VVEVKDGSGN LLSGQTVNFA ATTGGGSVSP ASAVTDANGQ AQTTLTLGLI PGAIANVTNT VEATADGIGS VTFTCVASHS TDNLTVLTTQ TPVDGNITDG VSYELGMKFR SASGGQIIGI RYWKAPSETG THTGKIWSAT GTLLASVSFT NETASGWQYQ ALETPLNIQA NTIYVVSVNG NSYYVATNNG LANSIINGDL SSVADNNNGV FNFNANSFPT SSWFNSNYFT DIVFVVGSRL VKVSGDNQSG ATGATLPNPL VVQVLDAQNN PVSGLTVNFA ITSGGGSLSA SSVVTQNGQA STNLTLGAVP TAPGGVVVIA TVDGIGSTYF TATATISNPN AIYLENLNPG TTAWKLVNRG SDEIAGYASA TSINKGQSID FKVSLGQAGQ FTIDVYRLGY YGGAGGRLMA SSGSLNGTTQ APGVIDPNTR LIECNWTTSY TLQTGNDWTS GLYVAKLTDQ ASGKIAHIWF VVRDDSSTGK VLFQSSVSTV LAYSTMGGYS LYTMNSINGQ RAYKVSYDRP FSQATYQESY EADTMLRWEY NMVRWLESQA YDVTYVSNMD VHTNPNLLLN HQVFLSVGHD EYWSKEMRDN VEAARNAGIN LAFFSANTCY WRVRFEDSTL NAGQVRPNRV MACYKSDWDL DPVAIQQGPS AATNKFRSFQ NQRPENALLG VMYGSDTPNI YGGFNMIITN STDPYYANTG LSNGDQLTLL VGYEWDFVVN NGSSPPGLVI LSQSGVQPAA LLPNYDEPPN EPGLPQNQNF NIANSVRYTA SAKVFASGTI QWAWGLDSDD VSPAREDVRV KQITVNILAD MGATPQSPDP NIIVP
|
| |