Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_27201 |
Symbol | |
ID | 4777877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2397284 |
End bp | 2398492 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088243 |
Product | NifS-like aminotransferase class-V |
Protein accession | YP_001018715 |
Protein GI | 124024408 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACA GAGCAACGGA ACTTTCCACC ATGCTGCCAA GCCTTTCCAG TGGATCCCTA AAACAATCCC CCCTTTGCTT GGATTATCAG GCAACCACAC CCTGTGCTGC CGAGGTTGTA AAGGCCATGG CCGCCTACTG GAGCGAGGAC TGGGGCAATG CCTCAAGCCG CCAACATCGC TCTGGATTGA AGGCCGCAGC CGCAGTGAGC CTCGCTCGAG AGCAACTGGC CTGCCATCTG CGCGTCACAC CGCAACGCGT GATCTTTACG AGCGGTGCCA CAGAAGCTAA TAACCTGGCG CTACTAGGCC ATGCCAGAGC AAGAGCAGAA CAACGCGGAG CACCAGGCCA CCTGATCACG CTGGTTACTG AACACCATGC CGTACTCGAT CCGCTGCGCC AACTGCAAAA GGAAGGTTTC CGACTAACGG AACTACAGCC ACGGGCGGAT GGTCTGCTGC GACCAGAGCA ACTGGCTGAG GCATTTGAAA ACGACACGCT GCTCGTGAGC GTGATGGTCG CCAACAATGA AACGGGCGTC ATCCAACCCC TAGCAGAGCT GTCAGAGCTT TGCCGGAAAC GTGATGTAGT TCTGCATAGC GATGCCGCGC AAGCCTTTGG TCACGTCCAT CTCGATCCAG ATGCTCTCGG GCTCGATCTG CTCAGCATCA GCGGCCACAA GCTTTACGGC CCTAAAGGGA TCGGCGCACT CGTCGTACGA CCTGAAGTGC CAATCAACCC CTTGCAATGG GGTGGAGGCC AAGAGCAGGG CCTAAGGCCT GGGACACTAC CCGTGCCATT AATTGTGGGC CTCGCTAAAG CCGTAGAGCT AGCCATGGAG GACATCAAGA GCCGCCAAGA CAAGCTCTGC ACGCTACGCA ATCAACTCTG GGATGGTTTA CGAGAACGCC TCCCCGATCT CATCCTCAAT GGATCGCTAG AGCACAGGCT GCCTCACAAT CTCAACATCA CAATTCCAGG AGTGCGCGGC AGCAGCCTGC ATCAACAATT GCGACCGCTC ATTGCTTGCA GCAGCGGCTC TGCCTGCAGC CAAGGAGCCC CATCCCATGT GCTGATGGCC TTAGGTCGCA CATCAGCTGA GGCGGAAGCC TCGCTTCGAC TGAGCTTGGG TCGCAACACC TCAAGCGAAG AGATCAGCCA GGCTGTGGAG TCGATTAGCA CTGTCGTGAC CGACCTGAGA GCTGGATAG
|
Protein sequence | MRDRATELST MLPSLSSGSL KQSPLCLDYQ ATTPCAAEVV KAMAAYWSED WGNASSRQHR SGLKAAAAVS LAREQLACHL RVTPQRVIFT SGATEANNLA LLGHARARAE QRGAPGHLIT LVTEHHAVLD PLRQLQKEGF RLTELQPRAD GLLRPEQLAE AFENDTLLVS VMVANNETGV IQPLAELSEL CRKRDVVLHS DAAQAFGHVH LDPDALGLDL LSISGHKLYG PKGIGALVVR PEVPINPLQW GGGQEQGLRP GTLPVPLIVG LAKAVELAME DIKSRQDKLC TLRNQLWDGL RERLPDLILN GSLEHRLPHN LNITIPGVRG SSLHQQLRPL IACSSGSACS QGAPSHVLMA LGRTSAEAEA SLRLSLGRNT SSEEISQAVE SISTVVTDLR AG
|
| |