Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03271 |
Symbol | |
ID | 4776142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 337028 |
End bp | 340582 |
Gene Length | 3555 bp |
Protein Length | 1184 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640085829 |
Product | hypothetical protein |
Protein accession | YP_001016344 |
Protein GI | 124022037 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02169] chromosome segregation protein SMC, primarily archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC CCCTCGAAGA GGGCTTCACA GTCGTCACAG GCCCCAACGG TTCGGGCAAG AGCAACATCC TCGACGGCGT TCTCTTCTGC CTTGGTCTTG CCACAAGCAG GGGTATGCGA GCAGATCGAC TGCCTGATCT TGTCAACAGC CGAATCCTTC GAGCAGGCAA GGCCGCTGAA ACCGTGGTGA GTGTGCGCTT CGACCTCAGC GACTGGCAGC CCGATCCAGC CGAGGCAGGT CTGGAGCCAC CAGAGGAAGG TCCCTGGATC AAAGCTGATC AAAAAGAGTG GACCGTCACC CGCCGCCTGC GGGTCATGCC TGGCGGCTCC TACAGCACGA GCTACAGCGC CGATGGCGAA CCCTGCAACC TGCAGCAACT TCAAACCCAG CTGCGCCGAC TGAGGATCGA CCCAGAAGGC AGCAATGTTG TGATGCAGGG CGATGTGACC CGCATCGTCT CGATGAGTAA CCGCGATCGC CGCGGCCTGA TCGACGAACT GGCTGGTGTA GCACTCTTCG ACAGCCGCAT CGAACAGACT CGCGCCAAGC TCGATGATGT TCAAGAGCGG CAGGAACGCT GCCGGATCGT GGAACAGGAA CTGCTGACCA CACGGCAACG CCTAGAACGA GACTGCGCCA AAGCCCGCAC CTACCAGGAA CTCCGCCAGC AACTTCAATT GGGTCGGCAA CAGGAACTGA TGCTGGCCTT TGAAGCAGCC CAGCAAGGTT TGCGTGATCT TCAAACCCGA CACCAACAAC TTGGCGAGCA AGAGGTACAC GACGCTGCGA ACCTCAAAGA GGCCGAAGAA AAGCTGGCCA AGGCCGCCGC CAACCTGAAG ACTCTGCAAG AGAACGTGAA GGCCCTCGGC GAAGACCAGT TGCTAGCGGT GCAAGCCGAG CTGGCGGGAC TGGAGACCCA GGCTCGTGAA CTCGAGCGCC AAGCAGAGCA ACACCAAAAT GAAGGTCAGC GCCTCCAAGG CGTTCGCCAA GACCTAAGCA ACCGCCGCAA ACAACTGCAA CAGGAGGCTC ACAGTCAGAC CGAAGATCCC CATCGCACGG CCTTAGAAGA TGCTGAGAAG ACCTGCAGAG ATGCCGAAGC CGCTGTAGAG GTCTCCCGTC GCCGCCTTGG AGACGTCGCC GGTCGTTCTG GTGCTTGGCT TGAGCAACAA CGGCAGCGCA GCTCTCGCCG TCAGGAACTT CAATCCACCC TTACACCACT GCAGCAGGAG CAACAACAAC TTCAAGAACG CCTGCGACAA GACGGAGAGC GGCGAGTCGA ACTCGAAGCA GAGCAGCAGC GTGACGGCAC CGAAGACCAG CAGGTACAGA AACAACTGGA TCAACTCGAA CAGGAATGGC AGGCACTCCT CCAGAACATC AGCGACAAGA AAGAACAGGT TCAGCAAGCC GCTGAATCCC TGGCAGTGCA GCAACGTACC CGCAGTCGAC TAGAGCAAGA ACAAACCCGC TTGGAAAAAG AGATCGCCCG GCTAGAGAGT CGTCGCGAGA CACTGCAGGA GAGTCGTGGC ACTGGAGCTC TAAGGCTGCT ACTTGAAGCC GGTCTGGAAG GTATCCACGG GCCCGTTGCC CAGCTTGGGG AGGTAGACGA TCGCCACCGA CTTGCTTTGG AAGTCGCTGC GGGTGCACGA CTCGCTCAAG TCGTCGTAGA CGACGATCGC ATTGCCGCCA AAGCCATCGA ACTGCTCAAA AGCAGGCGAG CAGGCCGGCT CACCTTCTTG CCTCTCAATC GCATCAAAGC ACCTGCAGCT AGCAGCAATA GCGCCCTCAT GCGAGGGCGA AAACCTGACA ACGCTGATAG CAGCACTGGC CTGATCGACA AAGCCTTTGA GCTGGTTCGT TTTGAGCCCA TTTATGCCGA GGTTTTTGCC TACGTCTTCG GCGAAACTTT GGTCTTTAGC GACCTGAAAT CTGCCCGGCT GCAGTTGGGA CGAACTCGAG CCGTAACTCT TGATGGAGAA CTGCTAGAGA AGAGCGGTGC CATGACAGGC GGCAGTTTTT CCGGTCGTAG CAACAGCCTT AGCTTTGGTA GCAGCAGCGA GGGAGATGAG GCCGAACCCC TACGCCGCAG GCTTCTTGAA CTTGGAGAAA CCTTGGTGGC TTGCCGCAGA GAGGAAGCTC TGCTCAGCCA AGTCCTCGAG GAAGCTCGCC CCTGCCTCAG GAATCTTGAA CAGCGCCAAG CCGCTCTGGA AGCCGAACGC ACTGCAGCCC GCCGCTCCCA TGGCCCCTTG ATGGAAAGAC GTCATCAACG CTCTCAAAAA GTGGAGGGGC TACAAGCTCA TCAAGAACAA CAGCAGCAGC GACTGAACGC TCTCATCGAA AAGCTGTCTC CCCTCACCCT TGAACTGCAG CAACTTGAAC AACATGAGCA GGAAGCCCAG GCGGATGGCG ATGCAGAGAC CTGGCAACGA TTACAAGCGG ATCTTGAAAC CGCCGATGAC GCACTGGGGA CAGCCAGAAC GAATCGGGAT CAGTTGCGGA CTGCCCAGCA GCATCGCCAC CTCGCCCTCG AACGCCTAGG CGATCAACAA AAGGGCCTGG AGGCTGAGGA AAAAAGACTG CAGGAAGCTG TCCAGGCGCT GGCAACTGCC CATGCTCAAT GGCGTGACCA GCAGCAAGAA CTGCAAGCGC GAAGACAAAC ACTAGAGAGC CAACAACAAG ACCTGCAGAC TCGCTTTGGC GAACAACGGC GCGCCAGGGA TGCGGCTGAA GCAGAAGTAG CAAACCAGCG CCAAGGCCTA CAAGAGGCCC AGTGGAACCT GGAACGCCTC CGACAGGATC GCCAGGCCCT TGCCGAAGAG CTTCGCAGCG GCGGCCTCCG ACTCAAAGAA CTCCAACAAG CCTTGCCAGA CCCTCCCCCG GAGATCCCCG CAGAGCTTCG CAGCGCAGGA CTTGAGGCCC TTCAAGCCGA TCTGCAGCAG ATTCAAAGCC GTATGGAAGC CCTGGAACCA GTGAACATGC TGGCCCTTGA AGAACTCGAA CAACTCGAAC AACGACTCGG AGATCTGGTC GAAAGGCTTG AGGTTCTCTC CCAGGAGCGA GAAGAACTAC TGCTCCGCAT CGAGACCGTC GCCACCCTTC GTCAGGAAGC CTTCATGGAA GCTTTTGAAG CAGTTGATGG TCATTTCCGT GACATCTTCG CCAGCCTCTC TGAGGGCGAT GGCCACCTAC AACTCGACAA CCCTGACGAC CCCCTCGAAG GCGGCCTCAC CCTGGTAGCA CATCCGAAAG GCAAAGCAGT GAGACGCCTT GCGGCTATGT CTGGTGGAGA AAAATCGCTT ACGGCACTCA GCTTTCTATT TGCTCTGCAG CGCTTCCGCC CCTCTCCCTT CTATGCCCTC GATGAAGTGG ACAGTTTCCT CGATGGGGTG AATGTGGAAC GCCTTGCTGC TCTGATCGCC CGCCAAGCAG AACAAGCCCA GTTCATGGTT GTGAGCCACC GCCGACCAAT GATCGGCGCC TCCAACCGCA CCATCGGGGT AACCCAGGCC CGTGGGGCTC ACACTCAGGT CGTGGGATTA CCCAATGCGG CGTGA
|
Protein sequence | MTIPLEEGFT VVTGPNGSGK SNILDGVLFC LGLATSRGMR ADRLPDLVNS RILRAGKAAE TVVSVRFDLS DWQPDPAEAG LEPPEEGPWI KADQKEWTVT RRLRVMPGGS YSTSYSADGE PCNLQQLQTQ LRRLRIDPEG SNVVMQGDVT RIVSMSNRDR RGLIDELAGV ALFDSRIEQT RAKLDDVQER QERCRIVEQE LLTTRQRLER DCAKARTYQE LRQQLQLGRQ QELMLAFEAA QQGLRDLQTR HQQLGEQEVH DAANLKEAEE KLAKAAANLK TLQENVKALG EDQLLAVQAE LAGLETQARE LERQAEQHQN EGQRLQGVRQ DLSNRRKQLQ QEAHSQTEDP HRTALEDAEK TCRDAEAAVE VSRRRLGDVA GRSGAWLEQQ RQRSSRRQEL QSTLTPLQQE QQQLQERLRQ DGERRVELEA EQQRDGTEDQ QVQKQLDQLE QEWQALLQNI SDKKEQVQQA AESLAVQQRT RSRLEQEQTR LEKEIARLES RRETLQESRG TGALRLLLEA GLEGIHGPVA QLGEVDDRHR LALEVAAGAR LAQVVVDDDR IAAKAIELLK SRRAGRLTFL PLNRIKAPAA SSNSALMRGR KPDNADSSTG LIDKAFELVR FEPIYAEVFA YVFGETLVFS DLKSARLQLG RTRAVTLDGE LLEKSGAMTG GSFSGRSNSL SFGSSSEGDE AEPLRRRLLE LGETLVACRR EEALLSQVLE EARPCLRNLE QRQAALEAER TAARRSHGPL MERRHQRSQK VEGLQAHQEQ QQQRLNALIE KLSPLTLELQ QLEQHEQEAQ ADGDAETWQR LQADLETADD ALGTARTNRD QLRTAQQHRH LALERLGDQQ KGLEAEEKRL QEAVQALATA HAQWRDQQQE LQARRQTLES QQQDLQTRFG EQRRARDAAE AEVANQRQGL QEAQWNLERL RQDRQALAEE LRSGGLRLKE LQQALPDPPP EIPAELRSAG LEALQADLQQ IQSRMEALEP VNMLALEELE QLEQRLGDLV ERLEVLSQER EELLLRIETV ATLRQEAFME AFEAVDGHFR DIFASLSEGD GHLQLDNPDD PLEGGLTLVA HPKGKAVRRL AAMSGGEKSL TALSFLFALQ RFRPSPFYAL DEVDSFLDGV NVERLAALIA RQAEQAQFMV VSHRRPMIGA SNRTIGVTQA RGAHTQVVGL PNAA
|
| |