Gene NATL1_05251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05251 
Symbolqri7 
ID4780725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp476784 
End bp477854 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content36% 
IMG OID640083800 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001014352 
Protein GI124025236 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.110591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATAA TTTTATCCCT CGAAACAAGT TGTGACGAGT CTGCAGCGGC TTTGGTTTCT 
AATGAAAAAG GAAAAATTGA TTTGTTAGCT AATGAAATAG CTTCACAAAT TGATGAACAT
GCTAATTGGG GTGGTGTTGT TCCAGAAATC GCTTCAAGAA GACATTTAGA AAACCTTCCA
TTTTTGATTG AAGAGGTTTT TGCAAAATCA ACATTACAGA TAAAAGATAT AGATGCAGTA
GCCGCAACTG TTACTCCAGG ATTAGCAGGA TCACTGTTGG TCGGATCAAT TACGGCAAGA
ACTTTAGCTA ATTTACATCA AATACCATTC TTAGGTATCC ATCACTTGGA GGGACATCTT
TCCTCAATAT ATTTGTCAGA AAACCATCCC AAACCCCCTT TTTTAGTCTT ATTGGTTAGT
GGAGGACACA CTGAATTGAT AAAAGTAGAT GTTAAACATA AGTATCAACG TCTTGGTAGA
AGTCATGATG ATGCAGCTGG AGAAGCTTTT GATAAGGTTG CAAGACTTTT GGGACTTTCA
TATCCAGGGG GCCCTGCAAT TCAAAAAATA GCTAAATCGG GAGACCCAAA AAAATTTTTA
TTTCCAAAAG GAAGAGTCTC TAAACCTGAA GGTGGTTTTT ATCCATATGA CTTTTCTTTT
AGTGGTTTAA AAACGGCTGT ATTTCGCCAG ATAGAAAAAA TTAGATCAGA AAATAAAAAA
TTACCAATAG AAGATATTGC TGCAAGTTTT GAATACATAG TGGCTGAAGT CTTAGTAGAG
AGGAGCTTTA AATGTGCCCT TGACCAAGGT TTAAATTCTC TTGTTTTAGT TGGAGGAGTT
GCTGCAAATG TGAGATTAAG GGAAATGATG CTTGCAAAAG CATCTAAAAA TTCAATTGAT
ATTACTCTTG CACCAATGGA ATTTTGTACT GATAATGCGG CAATGATTGG GGCGGCAGCT
TTGTTAAGAT TATCGTCTGA AGGCTTTAAA AGTTCAATGG AATTAGGTGT ATCTGCTCGT
TGGCCACTAG AAAAATCTGA TTCACTTTAT GATCCGATTC CTCCTTTTTA A
 
Protein sequence
MSIILSLETS CDESAAALVS NEKGKIDLLA NEIASQIDEH ANWGGVVPEI ASRRHLENLP 
FLIEEVFAKS TLQIKDIDAV AATVTPGLAG SLLVGSITAR TLANLHQIPF LGIHHLEGHL
SSIYLSENHP KPPFLVLLVS GGHTELIKVD VKHKYQRLGR SHDDAAGEAF DKVARLLGLS
YPGGPAIQKI AKSGDPKKFL FPKGRVSKPE GGFYPYDFSF SGLKTAVFRQ IEKIRSENKK
LPIEDIAASF EYIVAEVLVE RSFKCALDQG LNSLVLVGGV AANVRLREMM LAKASKNSID
ITLAPMEFCT DNAAMIGAAA LLRLSSEGFK SSMELGVSAR WPLEKSDSLY DPIPPF