Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_4571 |
Symbol | |
ID | 7108321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 5055076 |
End bp | 5056482 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643482788 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002379801 |
Protein GI | 218441472 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGTG GGACAGGGAA TGGAAGATTA TCGCCTTTCA TGCTTCAAAG CGCTGTAGGG TTGACAATTT TAGTGGCTTT AGGGTTATTA GGCTGGCTGA TTTTGTGGCT GAGTAATTTT AGTTTCGGAA ATCGCAGTTA TCGAGCAACC TTTATTTTTC CCAATGCTGG AGGGATGAGT GTAGGAACGC GAGTCGCTTA CCGAGGGGTG AGAGTGGGGC GGATTGTTTC GATTAATCCT GAACCGGGAG GGGTGGCCAT TGGGGTAGAA ATTTCACCCG CCGATCGCTT GATTCCTGCC AACTCATCCA TCGAAGCAGT ACAAGCGGGT TTGGTAGGGG AAACCTCTAT CGATATTATT CCTGTTGAAG GATTGCCCCC TGAAGGCGTT AAAGCTGACC CCCTCGATCC TAAGTGTAAT CGGGCGATTA TTATTTGTAA TGGGTCAAGA TTACAAGGGG AGGGGAAATT AGATGTTAAT GCTTTAATTC GCTCTCTTTT ACGTATTGCT GACGTGATCG ACGATCCCGA ATTTACGGGG ACTGTTAGAG TCATTGCCCA AAGAACAACA GATGCACTCG GTAATATTAG TAGCTTTAGC AAAGAGGCAT CGGGGTTAAT TAAAGATACC CGAAGCAACA GAACCATTAA CAGACTTGAT AATACCTTGC TTTCTATCGA TCAGGCCGCC GATAGTGTTA ATCAAGCCGC CAGTCAGATC GATCAGGTAA CAGGTGATAT CAATCAGGCC GCCGGTAGTC TTCAACGCTT AGGAAATCAA GCCTCTAGTT TATTAGAAGA AGCGGAACGG CAAGATACGC TTAAAAATTT AAATTCTACT TTAGTGTCTC TACAAGGATT TTCTGAACAA ATTCGCGGAT TTATTACCGT TAATCAGGGG AATATTGGTA ACACTTTAGT TGGACTGGGA GAAACTAGCC AAGAATTAGC CGCGACTTTA CGCAAGTTAG GCCCGATTTT AACTCAAGTC GAACAAAGTA AATTGGTGAC TAATTTGGAT ACTATATCCA ATAATACTGC TGCCTTAACC GGGAATTTAC GGGATATTTC CGCTAAACTT AATGATCCAG CCACAATTGT ACAATTACAA CAAATTCTTG ATGCTGCCCG TGCCGTTTTT GAAAATGCCA ATAAAATTAC TTCAGATCTC GATGAATTAA CAGGAAATCC CCAGTTTCGG AGAGATCTTA GGAGGGTAAT TGAAGGATTA AGTAATCTTG TTTCTTCTAG CGAACAACTG CAACAACAAT TAGAATATGC TCAAGTTTTA AACCGGGTTG AGGCTGAAGT TAATCGAATT CAGTCTAGGG AAAATATCTC TTTAAAGCCT AAAAATGTAA CTCCTACTCC TATTAAACCG AAAAATATTA CCCCATCTCA ACCTTAA
|
Protein sequence | MRSGTGNGRL SPFMLQSAVG LTILVALGLL GWLILWLSNF SFGNRSYRAT FIFPNAGGMS VGTRVAYRGV RVGRIVSINP EPGGVAIGVE ISPADRLIPA NSSIEAVQAG LVGETSIDII PVEGLPPEGV KADPLDPKCN RAIIICNGSR LQGEGKLDVN ALIRSLLRIA DVIDDPEFTG TVRVIAQRTT DALGNISSFS KEASGLIKDT RSNRTINRLD NTLLSIDQAA DSVNQAASQI DQVTGDINQA AGSLQRLGNQ ASSLLEEAER QDTLKNLNST LVSLQGFSEQ IRGFITVNQG NIGNTLVGLG ETSQELAATL RKLGPILTQV EQSKLVTNLD TISNNTAALT GNLRDISAKL NDPATIVQLQ QILDAARAVF ENANKITSDL DELTGNPQFR RDLRRVIEGL SNLVSSSEQL QQQLEYAQVL NRVEAEVNRI QSRENISLKP KNVTPTPIKP KNITPSQP
|
| |