Gene A9601_02831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02831 
SymbolmviN 
ID4716969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp260056 
End bp261639 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content29% 
IMG OID640077984 
Producthypothetical protein 
Protein accessionYP_001008678 
Protein GI123967820 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCAT TTTTAAAAAA TAATGTTTTT TCAATTTCGT TTGGTACCAG TCTAAGTAAA 
TTAGCTGGAT GCATAAGACA AATATTTATA GCTGCTGCTT TTGGGGTTGG CGTAACATAC
GACGCCTTTA ATTATGCTTA TATAATTCCT GGTTTTTTGC TAATAATCAT TGGGGGTATT
AATGGTCCAT TGCATAACGC AGTAGTTGCA GTTTTAACTC CGCTTAACAA AAAAAATGGA
GGGATTGTTT TAACTCAAGT AAGCATAAAA CTTTCAATAT TATTATTAAT CTTAGCGATA
TTTATTTATT CGAATTCCAG TTTATTAATT GATTTATTGG CCCCGAATTT AAGTTACGAA
ACTAAATCTA TTGCCACTTA CCAATTACAA ATACTTACAC CTTGCATCCC TTTGTCCGGC
TTCATAGGTT TAAGCTTTGG CGCCTTAAAT TCCCAAAGAA AATTCTTTTT ATCAAGCATA
AGTCCAGCAA TAACAAGCGT AACTATTATT TTTTTTATTT TATTTAATTG GATTTTCAAC
CAAGAAAATA CATCTTCTAA TTTTTTTGCT TATTCGGGAT TACTCGCATT TGCAACTTTG
ACAGGAACTT TAATTCAGTT TGTTGTTCAA ATTTGGGAAA TAAATAAAAT TGGTCTATTG
AGATTAGAGT CAACCTTCAA TTTATTTAAA GATGAAGAGA GGAGAATTTT CAAACTAATT
ATTCCAGCCT CTATCTCATC AGGTCTAAGT CAAATTAATG TTTTTATCGA TATGTTTTTC
GCTTCAAGTT TTCAAGGCGC AGCATCTGGA CTAGCTTACG GAAACTTTCT TATACAAGCC
CCCTTAGGGA TATTATCTAA CTCCTTGATT CTGCCATTAC TTCCAAAATT TTCTAAATTG
AGAAGTGAAA AAGACGAAAG AAGTCTCCAA AAAAAATTGA TAACTGGGGT AGAGTACTGT
TTCTTGACAG CTATTTTTTT AACCGGGTTT TTTATAACAT TCAATAATCA AATCGTACAA
TTAGTTTTTC AAAGAGGATC TTTTGATTAT TCAGCAACTT TAAAAGTAAA GAATATATTA
ATTGCTTATG CAGTTGGCAT ACCTTTTTAT CTTTATAGAG ATTTATTAGT AAGAACTTAC
TATTCAATTG AAAAAACCAA CTTCCCTTTT AAGTCTTCAT TTGCAGGGAT AATATTTAAT
ATTATTTTTG ATTGGTTTTT AATTGGTGCC CCAATTAAGA ATTTTGGGAA TCTTTCTCCT
TATAATTTTG GAGTCGTGGG AATAATTTTA TCTTCAGTAA TAGTAAATCT TATAGTTTGT
ATTTTTCTTT CTTTCAATTT GCGCAAAGAA AATATCATTT TGCCTAACCT GGAATTATTG
AGGAAAATTA GCCTCATGTC ATTAGCAACA TTTATAGACA GCACAATTTG TTTTACTATT
CTCCAAACTA CCAATAACTT CAATTCAAAT CTCGCGGAAT TTTTAATATT AATATTTGGA
ACTCTAACTT TTTTTGTGAT TTATTTTTTA CTTACAAAAT GCTTGAAAGT AAATAAATTT
AAAGTTTCAA AAAAAATGAT TTAA
 
Protein sequence
MHSFLKNNVF SISFGTSLSK LAGCIRQIFI AAAFGVGVTY DAFNYAYIIP GFLLIIIGGI 
NGPLHNAVVA VLTPLNKKNG GIVLTQVSIK LSILLLILAI FIYSNSSLLI DLLAPNLSYE
TKSIATYQLQ ILTPCIPLSG FIGLSFGALN SQRKFFLSSI SPAITSVTII FFILFNWIFN
QENTSSNFFA YSGLLAFATL TGTLIQFVVQ IWEINKIGLL RLESTFNLFK DEERRIFKLI
IPASISSGLS QINVFIDMFF ASSFQGAASG LAYGNFLIQA PLGILSNSLI LPLLPKFSKL
RSEKDERSLQ KKLITGVEYC FLTAIFLTGF FITFNNQIVQ LVFQRGSFDY SATLKVKNIL
IAYAVGIPFY LYRDLLVRTY YSIEKTNFPF KSSFAGIIFN IIFDWFLIGA PIKNFGNLSP
YNFGVVGIIL SSVIVNLIVC IFLSFNLRKE NIILPNLELL RKISLMSLAT FIDSTICFTI
LQTTNNFNSN LAEFLILIFG TLTFFVIYFL LTKCLKVNKF KVSKKMI