Gene NATL1_01621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01621 
SymbolxseA 
ID4779985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp156443 
End bp157603 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content31% 
IMG OID640083426 
Productexonuclease VII, large subunit 
Protein accessionYP_001013991 
Protein GI124024875 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGATC TTAAAAACGC TAAACAATCT CTAAATACAT ATAGTGTTAA AGAATTAAAC 
GAATCTATTG GCTTATTATT ATCAAGAGGC TTTGCCCCAA AGTTTATAGT TGAAGCCACT
GTTTCTAAAT CGCAAATAAA AAAAGGTCAT TTATGGCTAA CTTTAACGGA CGGGAAAGCA
AGTGTAGATG CGGTTGCATG GTCATCAACA ATAAAGTCTT TAAAATTTTT ACCAAAGCAA
GATGATGGCG TTGTTATTAT TGGTAAATTA AATTTCTGGG AATCTCAAGC AAGAGTATCG
GTACAAGTTT TTGATATTCG ACCAAGTATT TCTACGGTTC TTAGGAAGTT CGAAATAGTC
AAATCAAAAC TTTTTAAAGA AGGTTTGATT GATGATTCGT TAAGAAAAAA ATTGCCAAAA
TATCCTCATT CAATTGGTAT CCTTACAAGT GTTCCAAGCT CTGCTTTAGC TGACATGCTT
AGAACAGCTA AGGAGAGATG GCCATTAACG AAGCTGCAAA TAATTCCTAT TCCAGTTCAA
GGTGATAATG CAAATAAACT AAAATCTATT TTAAGTAAAT TAAAAAAAAA TAAGTTAAAA
TTAGAGGCTT TAATTATAGC TAGAGGAGGA GGTAGCAGAG AAGATTTAAT GTTGTTTGAT
AGTGAAATCA TAGCTAGAGA AATCGCAACT TTCCCAATAC CTGTAATTAC AGGGATAGGT
CACGAAGATG ATCTAACAGT TGCTGATCTG GTTTCAGATC ATCGATCTGC CACTCCAACT
GCTGCGATTG TTGATCTATT GCCCTCAAGA GAAATTGAAA AAAATAAGTT TTTACAAAAT
AAAAAATTAC TTAAATATTA TTTGAAATTG TTTTTTCAGA ACGCAAAGAA ATCATTAATT
ACAAAAAAAT CTATTTTTCA ATCTTATTCA CCCCGACTAT TAATAAAAAA TAAAAGAACA
AGAATAAATT ATATGTATGA TATTTTGAAT GCACTTTCTC CAAGAAAATT GTTAAAAAGA
GGTTTTGCGC TAATTACTGA CGAGTCAGGT AATTCGATTT ATAGTGTAAA AAATATTAAG
GAAAATGATA AGCTGATAGT TCAATTTTGT GATGGAAAAA TTACAGCAGA GGTTGATAGT
CTTAATTATG ATAAAATATA A
 
Protein sequence
MTDLKNAKQS LNTYSVKELN ESIGLLLSRG FAPKFIVEAT VSKSQIKKGH LWLTLTDGKA 
SVDAVAWSST IKSLKFLPKQ DDGVVIIGKL NFWESQARVS VQVFDIRPSI STVLRKFEIV
KSKLFKEGLI DDSLRKKLPK YPHSIGILTS VPSSALADML RTAKERWPLT KLQIIPIPVQ
GDNANKLKSI LSKLKKNKLK LEALIIARGG GSREDLMLFD SEIIAREIAT FPIPVITGIG
HEDDLTVADL VSDHRSATPT AAIVDLLPSR EIEKNKFLQN KKLLKYYLKL FFQNAKKSLI
TKKSIFQSYS PRLLIKNKRT RINYMYDILN ALSPRKLLKR GFALITDESG NSIYSVKNIK
ENDKLIVQFC DGKITAEVDS LNYDKI