Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03391 |
Symbol | mviN |
ID | 4779980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 312684 |
End bp | 314291 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640083606 |
Product | hypothetical protein |
Protein accession | YP_001014168 |
Protein GI | 124025052 |
COG category | [R] General function prediction only |
COG ID | [COG0728] Uncharacterized membrane protein, putative virulence factor |
TIGRFAM ID | [TIGR01695] integral membrane protein MviN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0314279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAT CAATCAAAGA AATTGCTTTC GTCGTAAGTT TAGGAACTTT ATTGAGCAAA TTTGGAGGGA TGGCTAGACA ATTAGTAATT GCTGGTGCTT TCGGAATTAG TGCTGCATAT GATGCATATA ATTATGCTTA TATTATACCT GGATTTTTTT TAGTTCTGTT AGGGGGTATT AATGGTCCAT TGCATAACTC AATGGTTACA CTATTGGCAG ACAAAAATAA AGTTGACAGT AGATTATTTA TAAGTTCAAT CAATAATATT TTATCTATAA TACTATTGAT TATAAGTTTA TTTATTTTCT TTTCATCTGA TTTTTTGATT AATTTGGTTG GACCGAGTTT AATACCTGAA ATAAAAGAAA TAGCATCCTA TCAATTAAAA ATAATGTCTC CAATAATCTT CCTATCTGGA CTGATAGGTC TTGGATTTGG ATCACTAAAT ACTAAAAAAG AATTTTTCAT TCCCTCTATA TCTCCATTAA TTTCAAGTCT AATAATCATA ATTTCAATAT CAAACTTTTG GATAAACAAA GGAAATACGA CTGATCTAGA TGCACTGAAT ATCAGAGGAG GAATTATTTT AGCAAAGGCA ACATTTATAG GTGCCCTATC TCAATATTTA ATTCAAATAC CGTTCCTAAT TAGAAAAGGA ATATTTGCGA TAAGTTTTTC AATACAAACA AAATATTCAG AAATAAAAAG GGCCTTGCAA ATGATTGCGC CTGCTTCACT TTCCTCAGGA ATGATACAAA TTAATGTTTT TACTGATTTG TTCTTTGCAT CGAAAATAGT TGGAGCTGCA GCTGCCTTAA GTTACGCAAA CTTTTTAGTT CAAGCACCTC TTGGAATAGT ATCAAATTCT ATTTTAATTC CATTATTACC GGTTTTTGTA AGTCTAAGAG CTCGAGAAAA TCATTTAAAA TTGATCAAAA AAATCCATCA GGGATTAATT CTTTCTTCGA CTTCTATGGT ATTTTTAGGG TCGTTATTTA TTTCACTTTC TACTCCAATA GTTGTATTAA TCTATGGTAG AGGTTCATTC AATGAAAATG CGATTGATGT AGTAAGTCAA CTATTAATTG CATATGGAAT AGGAATGCCT TTTTATCTAT GTAGAGATCT ATTGGTAAGA GTTTTTTATG GTATAGAGGA TGCAAGAACA CCATTTAGAA TATCAATCAT AGCAATTTTA CTAAATTTAT TTTTTGACTG GTTTTTCATA GGAGGTTCAA GTCCATGGGG GGAGCTATCA CCGCTAAACT TAGGAGTAAA TGGATTAGTC TTTTCAACTA CATTTGTTAA CTTCTTCGCT TGTACACTTT TACTATTTAA ATTAAATAAT AGATTAGATA ATCTAAATTT GTCTAATCTA TTATCTCAGA ATCTGAGAAT TATTCTAATT GGTTTAATTT CTGGTATATG CTCATTCTTT ATTTTTAAAA TAATATTTTT ACCCTATAGT TTTATAAATT TATTATTGAA ATTAATAATA TCATCTGGAA TTAGTCTGAT TATCTTTTAT TGCCTAGCAA TTATTCTTAA AATTGATCAT ATTAATAATT TAAACAAGTT TTTAAAAGAG AAGTTTATTC GTCTTTAA
|
Protein sequence | MSKSIKEIAF VVSLGTLLSK FGGMARQLVI AGAFGISAAY DAYNYAYIIP GFFLVLLGGI NGPLHNSMVT LLADKNKVDS RLFISSINNI LSIILLIISL FIFFSSDFLI NLVGPSLIPE IKEIASYQLK IMSPIIFLSG LIGLGFGSLN TKKEFFIPSI SPLISSLIII ISISNFWINK GNTTDLDALN IRGGIILAKA TFIGALSQYL IQIPFLIRKG IFAISFSIQT KYSEIKRALQ MIAPASLSSG MIQINVFTDL FFASKIVGAA AALSYANFLV QAPLGIVSNS ILIPLLPVFV SLRARENHLK LIKKIHQGLI LSSTSMVFLG SLFISLSTPI VVLIYGRGSF NENAIDVVSQ LLIAYGIGMP FYLCRDLLVR VFYGIEDART PFRISIIAIL LNLFFDWFFI GGSSPWGELS PLNLGVNGLV FSTTFVNFFA CTLLLFKLNN RLDNLNLSNL LSQNLRIILI GLISGICSFF IFKIIFLPYS FINLLLKLII SSGISLIIFY CLAIILKIDH INNLNKFLKE KFIRL
|
| |