Gene NATL1_03391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03391 
SymbolmviN 
ID4779980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp312684 
End bp314291 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content29% 
IMG OID640083606 
Producthypothetical protein 
Protein accessionYP_001014168 
Protein GI124025052 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0314279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAT CAATCAAAGA AATTGCTTTC GTCGTAAGTT TAGGAACTTT ATTGAGCAAA 
TTTGGAGGGA TGGCTAGACA ATTAGTAATT GCTGGTGCTT TCGGAATTAG TGCTGCATAT
GATGCATATA ATTATGCTTA TATTATACCT GGATTTTTTT TAGTTCTGTT AGGGGGTATT
AATGGTCCAT TGCATAACTC AATGGTTACA CTATTGGCAG ACAAAAATAA AGTTGACAGT
AGATTATTTA TAAGTTCAAT CAATAATATT TTATCTATAA TACTATTGAT TATAAGTTTA
TTTATTTTCT TTTCATCTGA TTTTTTGATT AATTTGGTTG GACCGAGTTT AATACCTGAA
ATAAAAGAAA TAGCATCCTA TCAATTAAAA ATAATGTCTC CAATAATCTT CCTATCTGGA
CTGATAGGTC TTGGATTTGG ATCACTAAAT ACTAAAAAAG AATTTTTCAT TCCCTCTATA
TCTCCATTAA TTTCAAGTCT AATAATCATA ATTTCAATAT CAAACTTTTG GATAAACAAA
GGAAATACGA CTGATCTAGA TGCACTGAAT ATCAGAGGAG GAATTATTTT AGCAAAGGCA
ACATTTATAG GTGCCCTATC TCAATATTTA ATTCAAATAC CGTTCCTAAT TAGAAAAGGA
ATATTTGCGA TAAGTTTTTC AATACAAACA AAATATTCAG AAATAAAAAG GGCCTTGCAA
ATGATTGCGC CTGCTTCACT TTCCTCAGGA ATGATACAAA TTAATGTTTT TACTGATTTG
TTCTTTGCAT CGAAAATAGT TGGAGCTGCA GCTGCCTTAA GTTACGCAAA CTTTTTAGTT
CAAGCACCTC TTGGAATAGT ATCAAATTCT ATTTTAATTC CATTATTACC GGTTTTTGTA
AGTCTAAGAG CTCGAGAAAA TCATTTAAAA TTGATCAAAA AAATCCATCA GGGATTAATT
CTTTCTTCGA CTTCTATGGT ATTTTTAGGG TCGTTATTTA TTTCACTTTC TACTCCAATA
GTTGTATTAA TCTATGGTAG AGGTTCATTC AATGAAAATG CGATTGATGT AGTAAGTCAA
CTATTAATTG CATATGGAAT AGGAATGCCT TTTTATCTAT GTAGAGATCT ATTGGTAAGA
GTTTTTTATG GTATAGAGGA TGCAAGAACA CCATTTAGAA TATCAATCAT AGCAATTTTA
CTAAATTTAT TTTTTGACTG GTTTTTCATA GGAGGTTCAA GTCCATGGGG GGAGCTATCA
CCGCTAAACT TAGGAGTAAA TGGATTAGTC TTTTCAACTA CATTTGTTAA CTTCTTCGCT
TGTACACTTT TACTATTTAA ATTAAATAAT AGATTAGATA ATCTAAATTT GTCTAATCTA
TTATCTCAGA ATCTGAGAAT TATTCTAATT GGTTTAATTT CTGGTATATG CTCATTCTTT
ATTTTTAAAA TAATATTTTT ACCCTATAGT TTTATAAATT TATTATTGAA ATTAATAATA
TCATCTGGAA TTAGTCTGAT TATCTTTTAT TGCCTAGCAA TTATTCTTAA AATTGATCAT
ATTAATAATT TAAACAAGTT TTTAAAAGAG AAGTTTATTC GTCTTTAA
 
Protein sequence
MSKSIKEIAF VVSLGTLLSK FGGMARQLVI AGAFGISAAY DAYNYAYIIP GFFLVLLGGI 
NGPLHNSMVT LLADKNKVDS RLFISSINNI LSIILLIISL FIFFSSDFLI NLVGPSLIPE
IKEIASYQLK IMSPIIFLSG LIGLGFGSLN TKKEFFIPSI SPLISSLIII ISISNFWINK
GNTTDLDALN IRGGIILAKA TFIGALSQYL IQIPFLIRKG IFAISFSIQT KYSEIKRALQ
MIAPASLSSG MIQINVFTDL FFASKIVGAA AALSYANFLV QAPLGIVSNS ILIPLLPVFV
SLRARENHLK LIKKIHQGLI LSSTSMVFLG SLFISLSTPI VVLIYGRGSF NENAIDVVSQ
LLIAYGIGMP FYLCRDLLVR VFYGIEDART PFRISIIAIL LNLFFDWFFI GGSSPWGELS
PLNLGVNGLV FSTTFVNFFA CTLLLFKLNN RLDNLNLSNL LSQNLRIILI GLISGICSFF
IFKIIFLPYS FINLLLKLII SSGISLIIFY CLAIILKIDH INNLNKFLKE KFIRL