Gene NATL1_00161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00161 
SymboldnaJ 
ID4779788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp19556 
End bp20686 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content40% 
IMG OID640083279 
Productchaperone protein DnaJ 
Protein accessionYP_001013845 
Protein GI124024729 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.296062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.971864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT TTTACGATCT ATTGGGTGTC AGCAGAGATG CTGATGCTGA CACTTTAAAA 
AGAGCTTATA GACAGCAAGC TCGGAAATAT CACCCTGACG TCAATAAGGA AGCAGGTGCA
GAGGATAAGT TCAAAGAAAT AGGCAAAGCA TATGAAGTTT TAAGCGACTC TCAAAAGCGA
GCTCGTTACG ACCAATTTGG AGAAGCTGGA ATAGGTGGGG CCGCTGGCAT GCCGGATATG
GGAGATATGG GTGGCTTTGC AGATCTGTTT GATACCTTTT TTAATGGCTT TGGTGGTGCT
AGTTCAGCTG GAGGTTCTCG CCCTCAAAGA CGCGGACCAC AACAGGGAGA CGATTTACGT
TACGACCTAA CGATTGATTT TGATAAAGCT ATTTTTGGAC AAGAAAAAGA GATTACGGTC
CCTCATTTAG AAACTTGTGA TGTTTGCAGA GGCACTGGGG CTAAGAAAGG CACTGGTCCT
GTTACTTGTT CTACATGTAG TGGTGCAGGT CAAGTAAGGA GAGCGACTCG TACACCTTTT
GGAAGTTTTA CTCAAGTAGC TGAATGCCCA ACCTGTGGTG GTACTGGACA AGTGATTAAA
GATCCTTGCA ACGCTTGTGG AGGGAAAGGC GTTAAACAAG TAAGAAAAAA ATTAAAAATT
AATATTCCTG CTGGAGTTGA TAGCGGAACA CGATTAAGAG TTTCAGGAGA GGGTAATGCT
GGATTAAAGG GTGGTCCATC TGGAGATCTA TATGTTTTTT TAAAAGTTAA AAATCATCCT
AATTTAAAGA GAGATGGATT GACAATTTTA TCTGAGGTTA ATATTAGTTA CCTTCAGGCA
ATTTTAGGAG ATACTATTGA AATAGAGACT GTAGATGGCC CTACTAAGTT GCAAATTCCA
GCAGGGACCC AACCTAACTC TATTTTGAAT TTAGAAAATA AAGGAGTGCC AAAACTAGGC
AATCCAGTTG CTAGAGGTAA TCATCAAGTC TCAGTAAAGA TTAAATTACC TACAAAATTA
TCAGATTCTG AAAGAAATTT ATTAGAAGAA TTAGCTGGAC ATTACTCTGC ACTTGGACCA
CAACATCATT ATCATAAAAG TGGCTTATTT AGTAAGTTAT TTGGCAAATA A
 
Protein sequence
MADFYDLLGV SRDADADTLK RAYRQQARKY HPDVNKEAGA EDKFKEIGKA YEVLSDSQKR 
ARYDQFGEAG IGGAAGMPDM GDMGGFADLF DTFFNGFGGA SSAGGSRPQR RGPQQGDDLR
YDLTIDFDKA IFGQEKEITV PHLETCDVCR GTGAKKGTGP VTCSTCSGAG QVRRATRTPF
GSFTQVAECP TCGGTGQVIK DPCNACGGKG VKQVRKKLKI NIPAGVDSGT RLRVSGEGNA
GLKGGPSGDL YVFLKVKNHP NLKRDGLTIL SEVNISYLQA ILGDTIEIET VDGPTKLQIP
AGTQPNSILN LENKGVPKLG NPVARGNHQV SVKIKLPTKL SDSERNLLEE LAGHYSALGP
QHHYHKSGLF SKLFGK