Gene NATL1_06381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06381 
Symbol 
ID4779339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp578598 
End bp579584 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content36% 
IMG OID640083916 
Productnucleoside-diphosphate-sugar epimerases 
Protein accessionYP_001014465 
Protein GI124025349 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0655453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.975907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAACG AAATAATTCT GATTACTGGA GCAAGTGGAT GTGTTGGGCA ATACATAGCA 
AATTGGCTAA TCGAAAACTC AACTTCAGAA TTATTTTTAT GGGTTAGAGA TCCTAAAAAA
ATAACTTCAA TAAATTTAGA AAACCCAAGG ATAAAAATTT TAGTCGGAGA TTTGAGAGAA
TCAAATAAGT TCAAGAAAGA AATTTCAGAA GTCAACAGAG TTATTCATAC TGCGACTGCT
TGGGGTGATC CTAAAAGGGC GAAAGAAGTC AATATTGATG CAGTAAAAAA TTTGCTCAAT
TTACTAAATC CCTCCAATAT CAAACAAATT ATTTATTTCT CAACTGCAAG TGTTCTTGAC
AGAAACTTAA ATTTGTTACC GGAAGCTTTT ACCTATGGAA CAGAGTACAT ACAAACAAAA
GCACAATGCC TCAGAGAGCT TGAGTCTCAT CAGCTTGCAA CGAAGATCAT AGCTGTTTTC
CCAACACTAG TTTTTGGCGG ACGTTTAGAC GGTAAAAGTA AATTTCCAAC CAGCTATCTT
ACCGAAGGAC TTAGAGATGC ATTGAGATGG ATCTGGCTGG CTAGATGGAT AAAATTATCC
TCAAGGTTTC ATTTTATTCA CGCAGCAGAT ATCGCTTTCA TTTGCGGGCA TCTGGCTACT
TCTGATTTCG AGCCCATACA ACCTTTTTCT GCCACTAAAA TAAAAAAATT AGTTTTAGGT
CAACCCTATA CAAGTATTGA TGTAGTAATT CAGACGCTTT TAATATGGAA AGGAATGAGA
AGAGTCCCTC AAATCCCAGT CTTGAACTGG CTTATTGAAC TTTTAACTGT ATTACTACCA
ATTCAAATGA CAAACTGGGA TAGATTTAGT CTTAGACAAA AACACTTTAT ACATGAGCCC
GTAACCTCTC CTGAAACCTT CGGGGGTATA AGTCATGCCA AAACGCTAAG TCAAGTTTTA
CATAATTCTG GTTTAACTAA ACACTAA
 
Protein sequence
MKNEIILITG ASGCVGQYIA NWLIENSTSE LFLWVRDPKK ITSINLENPR IKILVGDLRE 
SNKFKKEISE VNRVIHTATA WGDPKRAKEV NIDAVKNLLN LLNPSNIKQI IYFSTASVLD
RNLNLLPEAF TYGTEYIQTK AQCLRELESH QLATKIIAVF PTLVFGGRLD GKSKFPTSYL
TEGLRDALRW IWLARWIKLS SRFHFIHAAD IAFICGHLAT SDFEPIQPFS ATKIKKLVLG
QPYTSIDVVI QTLLIWKGMR RVPQIPVLNW LIELLTVLLP IQMTNWDRFS LRQKHFIHEP
VTSPETFGGI SHAKTLSQVL HNSGLTKH