Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00191 |
Symbol | |
ID | 4777742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 23070 |
End bp | 24986 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640085518 |
Product | general secretion pathway protein E |
Protein accession | YP_001016041 |
Protein GI | 124021734 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCG TTTGCAAATT CACGTCTAGA CATGTTGATG GGAGTAGCAC ACCTCCCACA AGCGGCGGAA GCCCGCACCT CCAATTTCAT TTCTTCTTCG TGACCCAGGG CCGACCAATC CCCAAAGCAA CCAACAGCAT CCAGCAGCGA CTGGAACTGG AACTGCTCCT ACAAGTCAGT GTGCTTAGCC AAGAAGAACT TGTTGTCGGA GTGGAGCTAA TGGCCAACCA CACCACTCTT GACATCTCAA CGTGGCAACA ATTCCAGGCC CTGCCAATCA ATATGCACAA CCAACATTTG GTTGTAGCAA TCTCGGACCA ATGCAATGAG CAAACCAAAA ATCAACTGAT CTCATTGCTG CAGTCGCAAG GCTTCAGCAC AGAATTACGT CTCGCGCTGG CATCCGACAT CAGCCAGCTG CTTGCACCAA TGAGATCTGA GCAGCACGGT GAATCTGCAT CCAAGTCAAA AGCAACAAAG CCACTTGCAC AAACACCAAC CTCCCTGCTG GCAGGATTTA GTGCCGAAGG CGTGCTCGAA GAAGATCCTG AAGAACAAGC CAGACTTGCT AGTTCCGTAG AGGATTTGGA GTCCAGCTTG ATGGACTCAG ACAGCTCACC AGTCATCAAC CTGGTGGACC GCATACTGCT AGAGGCACTG CAAACAGAAG CCAGCGACGT ACACGTTGAG CCGCAACAAG ACGGACTACA GATCCGCTTC CGTCAAGACG GTGTTTTGCA GCGCTATATC GAACCCCTCC CGAGCCGGTT GATCCCTGCT GTTACCTCAC GCTTCAAGAT CATGGCCGAC CTAGACATCG CAGAACGGCG CATGGCTCAG GACGGTCGCA TTCGTCGCAC ATATCGCAAT CGTATGGTCG ATTTCAGGGT TAATAGCCTG CCAAGCCGTT ACGGGGAAAA AATCTGTCTG CGACTCCTCG ACAGCAGTGC TCCACAACTC GGGCTAGACA AACTGATCAG CAACCCAAGT GCTCTCTCTC TCGTACGCAA CTTGGGCTCT AAGCCCTTCG GCATGATCCT TGTGACAGGT CCAACAGGAT CAGGCAAGTC AACAACCCTT TACTCCCTAC TGGCAGAACG CAACGAGCCT GGTATCAACA TTTCCACCGT AGAAGATCCC ATCGAATACA CCCTCCCTGG GATTACCCAA TGTCAAGTGA ACAGGGAAAA AGGCTTCGAT TTCAGTACTG CACTGAGAGC CTTCATGCGT CAAGACCCAG ACGTTCTACT GGTCGGTGAA ACCCGTGACC TAGAAACCGC AAAAACCGCC ATCGAAGCCG CACTGACTGG TCATCTGGTG CTAACCACAC TGCATTGCAA TGACGCATCA AGTGCGATCG CCCGCCTCAA CGAAATGGGC GTGGAACCAT TCATGGTGAG CGCCTCGTTG ATTGGCATCG TCTCTCAAAG ACTTCTACGA AGAGTGTGCC GTTCTTGTCG TAAGTCCTAT CACCCAACTG AAAAAGAACT GGGGCGATTC GGATTGATGG CCCATACAGA AACTGGCGTG ACCTTTTTCA AGGCCCACCA TCACGGTCAA GAAAAACAGC CGTGCCCCAA CTGTCAAGGC AGCGGCTACA AGGGCCGAGT TGGTGTCTAT GAAGTACTGC GCATGAATGA AGAGCTCGCT ACAGCCGTCG CCAAAGGTGC CACTACTGAT TTAGTAAGAC GTCTGGCGCT CGAGGCTGGT ATGAAAACTC TGTTGGGCTA CAGCCTTGAC CTGGTGCGAG AAGGCCACAC CACCCTTGAG GAAGTGGGCC GAATGATCCT CACCGATTCC GGATTGGAAT CAGAGCGCCG CGCCAGAGCG CTTAGCACTC TCACCTGCAA CGTCTGCGGG GCAGGCCTCC AAGACGGCTG GCTGGAATGT CCCTACTGTC TAACCGCTCG CCAATGA
|
Protein sequence | MLTVCKFTSR HVDGSSTPPT SGGSPHLQFH FFFVTQGRPI PKATNSIQQR LELELLLQVS VLSQEELVVG VELMANHTTL DISTWQQFQA LPINMHNQHL VVAISDQCNE QTKNQLISLL QSQGFSTELR LALASDISQL LAPMRSEQHG ESASKSKATK PLAQTPTSLL AGFSAEGVLE EDPEEQARLA SSVEDLESSL MDSDSSPVIN LVDRILLEAL QTEASDVHVE PQQDGLQIRF RQDGVLQRYI EPLPSRLIPA VTSRFKIMAD LDIAERRMAQ DGRIRRTYRN RMVDFRVNSL PSRYGEKICL RLLDSSAPQL GLDKLISNPS ALSLVRNLGS KPFGMILVTG PTGSGKSTTL YSLLAERNEP GINISTVEDP IEYTLPGITQ CQVNREKGFD FSTALRAFMR QDPDVLLVGE TRDLETAKTA IEAALTGHLV LTTLHCNDAS SAIARLNEMG VEPFMVSASL IGIVSQRLLR RVCRSCRKSY HPTEKELGRF GLMAHTETGV TFFKAHHHGQ EKQPCPNCQG SGYKGRVGVY EVLRMNEELA TAVAKGATTD LVRRLALEAG MKTLLGYSLD LVREGHTTLE EVGRMILTDS GLESERRARA LSTLTCNVCG AGLQDGWLEC PYCLTARQ
|
| |