Gene NATL1_03991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03991 
SymbolxylB 
ID4780902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp368016 
End bp369251 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content34% 
IMG OID640083668 
Productcarbohydrate kinase 
Protein accessionYP_001014228 
Protein GI124025112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.14778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACA ATTCTTTTGT ACTTGGTATA GATCTTGGTA CATCAGGCGT AAGGATTGCA 
ATCATTAATA CTAAAAAAAA AATACTATTC ACATCATCAA CAAAATACTC TAAAGGTCTG
GAAATATCTG GAGACTGGAT AAATAGCCTC AAAAATCTAA TACAAGAAAT TCCAAAAGAT
CTTAAAGAAA AGTTGGTTTC TTGTTCAGTA GCAGGAACCT CTGGAACACT TTTAGCATGC
AATAGAGATG GAGTTCCTCT AGGGAAAGCC CTACCTTATT TCTTACCCTT TTCGGAATAT
TCATGCGAGA TAGAAAATCT ATTTACTAAA GAATGTGCAG GATCAAGTGT AAGTGGAAGT
GTTGGAAGAG CGCTAAAACT TCTAGCCTTA TATGGTAATG AAATAATCTT AAGGCACCAA
GCAGATTGGA TTAGTGGATG GCTTATCAAT AATTGGGAGT ATGGGGAAGA AGGTAACAAT
ATTAGGATGG GTTGGGAAAT ATCAAATAGT TCATGGCCAG AAAAATTTCA AAATTTAAAA
TGGTTAAAAT GTCTTCCGAA AATAATTCCT TCAGGTCAAA TAATGGGAAA TATATGTACT
AAAAAAGCAA ATGAATTAAG TTTACCAAAA AATCTTAAAG TCATAGCAGG AACTACAGAT
TCTAATGCTG GGGTTTTAGC TACTTTCCCT AATAAAAATG ATGGGATAAC AATCCTTGGT
AGCACAATAG TAATTAAAAA ATTTGTAAAT AACCCCTTGG AGGGGAAAGG TATTTCAAAT
CATAAATTGT TAGGGAATTG GCTATCTGGT GGAGCATCTA ATACAGGGGC TTCGATACTA
CTAGACTTCT TTAATCTTGA ATATATTGCA GAATTAAGCA AACAAATAAA TCCTAATAAA
TCATCAGGAT TAAACCTTCT TCCATTGTCA AGTCAAGGAG AAAGATTTCC AATAGATGAC
CCCAATTTAC AACCTAAACT TGAGCCAAGA CCAGTCAGTG ATTCTCTTTA TCTTCATGCA
TTATTTGAAG GGTTAGCGAA AATAGAAGCA AGAGGCTGGC AAAAACTTAA TGAATTAGGA
GCTGATTTAC CTCGGCAAAT AATTACTATT GGAGGAGGTG CAAAAAATAT TACTTGGAAA
AAAATAAGAG AAAGAGAAAT TGGCATACCA ATAAAAATAT GCAACACCCC CCCCGCTGCT
GGAGTAGCAA GTATTGCTTT GCAGGGATTA TTATGA
 
Protein sequence
MLNNSFVLGI DLGTSGVRIA IINTKKKILF TSSTKYSKGL EISGDWINSL KNLIQEIPKD 
LKEKLVSCSV AGTSGTLLAC NRDGVPLGKA LPYFLPFSEY SCEIENLFTK ECAGSSVSGS
VGRALKLLAL YGNEIILRHQ ADWISGWLIN NWEYGEEGNN IRMGWEISNS SWPEKFQNLK
WLKCLPKIIP SGQIMGNICT KKANELSLPK NLKVIAGTTD SNAGVLATFP NKNDGITILG
STIVIKKFVN NPLEGKGISN HKLLGNWLSG GASNTGASIL LDFFNLEYIA ELSKQINPNK
SSGLNLLPLS SQGERFPIDD PNLQPKLEPR PVSDSLYLHA LFEGLAKIEA RGWQKLNELG
ADLPRQIITI GGGAKNITWK KIREREIGIP IKICNTPPAA GVASIALQGL L