Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01971 |
Symbol | |
ID | 4779127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 182340 |
End bp | 183911 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640083461 |
Product | hypothetical protein |
Protein accession | YP_001014026 |
Protein GI | 124024910 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATATTGA CCGAGAGCAT TTCCCACAAC TGTAGGCCTC CAATTTTATG GATTTTTCTT TTTTGGACCA TAGCTTGTGG AATTGCATTT GTCTCGCTGG GTAATTTGCC TTTAAGAGAT TTTGATGAAG CAACTGTCGC AAGAGTTGCA TTGGAATTAA ACCAAAAAAG TGGACTAGAA CGATTGCTTC CTTCTATCTG GGATAAGCCC TATTTGAATA AACCCCCAGG GTTGCATTGG ATAATATCTT TTGCAATTGG AATTAGTAGA AGTTTCCAAA ATAATTTTGA TTTTTTACCC TCTGAGTTTT GTATAAGATT TTTCCCAGCA CTATTTTCGA CTTTTGTTGT TCCATTGGGT GGTCTTATTC AGTGGAACTT GCGCCCAAAA GATCGAATAG CATGCATAAC TACATCAGCA ATTTTATTGA CTTTGTTGCC AATTATTAGA TATGGGCGAA TGGCAATGTT GGATGGAACG CAGCTTAGTG CTATTGCACT TTTATGGTTT TGCTTATCAT CTATAAAAAA TAATAGGCCA ACTAATTTTA ATTTTTTAGG AGCTGGATTT ACATGTAGTT TCATGCTTTT ACTTAAAGCC CCTGTAATTA TTCCTGCACT ATTTGCATCT TTATTACCTT TAATTTGGGA ATATAATTCA AAAAAATATT TTAATAATTT TTCATGGTCT TGGTTCTTCT ATGGACTAAT TCCAGGTTTT GCTTGGCATG TATGGAATTT CATTTCATAT GGTTCAGGAG CTTTTTGGTT GTGGTGGGGA GATGGGGCAG GAAGAGTTTT ATTTGAAAAA GGCTCAGGTA GCGAGCTAGG AGTTTTGGTA CCAATAATTG AAATACTTGA AGGGGGATGG CCTTGGATTC TTCTATGGCC AATTGGTTTT TTGTGGGCAT GCTTGAGCCT TAATACTCGT TGGGGAGTTT GGGCTTTAAG TACTCAGATT ATTATTTTAG GAAGTATTTT ACCTCTAAAA ATGCAACTTC CTTGGTATAT TCATCCATTT TGGTTGCCCT TTGCTTTGGT ATGCGGACCT CCAGTTTCTT GGTTAATTCA AAGAGAAGAG AACGGTTATA TTTTCGCTAG AAAAATTTTA AGAAAAATCC CATATATATT TTCTTTAATT GGACTATGCA TATTGACTTT TTCTTTATTA CTTAAGTTAA ATATTTTCAA CATTGGAGAA GGTTACTTTT ATTCAATTTT CTTTATAAGT TTAGCTTGGT TTATTGGGGG ATTATTATTA TCTAATTCAA GAAAGAATAT TAGAAAAATT GGTTTTATTG GATTGATTGT TGGAAGCATA ATAGGCTTAT TCTTTTTTGT GAGTTCAAAA TTTTGGTTAT GGGAAATAAA TGAAAATTGG GATGTAAGAC CTGTAGCTGA ATTTATAGAT GGCTTTCCTA ATCAACAAAT TTTTATCAAA AATAGCTTTG AGAGACCAAG TTTAAATTGG TATGCAGGAA AACAAATCAA AAGTTTTGAC GAGGAAGATA AAACTAAATG CAAAGTAATT AAGAAAACTA ATGCTTGGGA TCTCTATACA TGTAATGATT AA
|
Protein sequence | MILTESISHN CRPPILWIFL FWTIACGIAF VSLGNLPLRD FDEATVARVA LELNQKSGLE RLLPSIWDKP YLNKPPGLHW IISFAIGISR SFQNNFDFLP SEFCIRFFPA LFSTFVVPLG GLIQWNLRPK DRIACITTSA ILLTLLPIIR YGRMAMLDGT QLSAIALLWF CLSSIKNNRP TNFNFLGAGF TCSFMLLLKA PVIIPALFAS LLPLIWEYNS KKYFNNFSWS WFFYGLIPGF AWHVWNFISY GSGAFWLWWG DGAGRVLFEK GSGSELGVLV PIIEILEGGW PWILLWPIGF LWACLSLNTR WGVWALSTQI IILGSILPLK MQLPWYIHPF WLPFALVCGP PVSWLIQREE NGYIFARKIL RKIPYIFSLI GLCILTFSLL LKLNIFNIGE GYFYSIFFIS LAWFIGGLLL SNSRKNIRKI GFIGLIVGSI IGLFFFVSSK FWLWEINENW DVRPVAEFID GFPNQQIFIK NSFERPSLNW YAGKQIKSFD EEDKTKCKVI KKTNAWDLYT CND
|
| |