Gene NATL1_10011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10011 
SymboluvrC 
ID4780129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp921564 
End bp923486 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content31% 
IMG OID640084279 
Productexcinuclease ABC subunit C 
Protein accessionYP_001014824 
Protein GI124025708 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.404154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.496638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTAA TACCGTTAAT AAGGGACAAG TCAAGATTAT CGGATTTTTT GAAGGATATA 
CCTAATGATC CTGGATGTTA TTTGATGAAA GATGGTGAGG ATAGATTGCT TTATGTTGGT
AAATCTAAAA AGTTAAGGAA TAGAGTTAGA AGTTATTTTC GTTCAGGTAA TGAATTAAGT
CCTAGAATAT CTTTAATGGT GAGACAAGTT GCAGATATTG AATTGATAGT TACTGATAAT
GAAAGTGAAG CATTAACATT AGAATCAAAT TTAATTAAAT CTCACCAACC ATATTTCAAT
GTCTTACTAA AAGATGATAA AAAGTATCCC TATGTTTGTA TTACTTGGGG TGATAAATAT
CCAAGAATTT TTTTAACTAG AAAAAGGCGT CAACGACAAT TAAAAGATAA ATATTATGGT
CCTTATGTAG ATGTTTATTT ACTTAGAAAA ACTCTATTTA GTATAAAAAA ATTGTTTCCA
CTCAGGCAAA GAAGAATTCC GCTTTATAAG GATAGAACAT GCCTTAATTA TTCAATTGGA
AGATGCCCTG GTGTTTGCCA GGAAGAAATA AGTTCAGAAG ATTACAAAAA CACTTTAAAA
AGAGTTGAAA TGATATTTCA AGGAAGAACG GATGAATTAA GAATATTATT AGAAAAACAA
ATGATTTCTT TTTCAGAGTC ATTGAAATTT GAAGAGGCTG GATCAGTTAG AGATCAGCTT
AAGGGTATAG ATAGATTGTA TGAATCTCAA AAGATGATCA TACCAGATTC ATCTGTTTGT
AGGGATATAA TTGCAATGGC ATCAGAAGAA AATATAAGCT CAGTACAAAT TTTTCAAATG
CGATCAGGTA AATTAATTGG TCGTTTAGGA TATTTCTCAG ATAATAGTAA TTTTAATTCA
TCTCAAATAC TTCAACAAGT AATAGAAAAT CATTATTCAA ATGTAGATCC TGTTGAAATC
CCATCAGAAA TATTAGTTCA ACATCAACTT GTAAATAATA TTTTAATTTC AGATTGGCTT
AGTGAAATAA AAAAGCAAAA AGTTAATATA AATGTTCCTA AAAGATCTAG AAAAGCAGAG
ATTATTAAAC TCGTAGAAAA AAATGCTAAT TTAGAATTAC AAAGAATTAA ACAATCTCAT
GATAAGAATT TAGTTGAACT TGATGATCTG ACTAATATCC TTGATTTAGA AAATATTCCA
AAGAGAATTG AATGTTATGA CATAAGCCAT ATCCAAGGAA GTGACGCTGT TGCATCACAA
GTAGTATTTA TTGATGGTAT TGCGGCAAGG CAACACTATA GAAGATATAA AATTAAAAGC
CCAAATATAA AAATTGGTCA CAGCGACGAT TTCGAATCAA TGGCTGAAGT GATAACTAGA
AGATTTAGAA GATGGGCTCG TTTTAAAGAA GAAGGTGGAG ATATTAATGC CCTACTAAGT
AATCAAAGCA GTGTTCTAGA TAACCTGAAT TTAAATGACT GGCCAGATCT CGTTGTGATA
GATGGAGGTA AAGGTCAATT AAGTTCTGTC GTAGCTGCTC TTGAGGAACT TAAACTTGAT
CAAAATTTAA ATGTTATATC TTTAGCAAAA AAGAAGGAGG AAGTTTTTAT TCCTAATGTT
AAACAATCAT TAGTTACCGA ATCAAATCAA CCAGGAATGC TTTTGCTAAG GAGACTGAGA
GATGAAGCTC ATAGATTTGC AATTACTTTT CATAGGCAAA AAAGGAGTCA ACGGATGAAA
CGTTCTCAGT TAAATGAAAT ACCGGGTCTT GGACCTCAAA GAATAAAATT ATTGCTTGAG
CATTTCAGGT CAATTGAGGC AATACAAATG GCTACTTTTT CTGAACTTTC ATCAACACCC
GGCTTAGGCA GATCAACTGC TGTTGTTATT AGAAACTATT TTCATCCCGA TAAAAATAAA
TAA
 
Protein sequence
MELIPLIRDK SRLSDFLKDI PNDPGCYLMK DGEDRLLYVG KSKKLRNRVR SYFRSGNELS 
PRISLMVRQV ADIELIVTDN ESEALTLESN LIKSHQPYFN VLLKDDKKYP YVCITWGDKY
PRIFLTRKRR QRQLKDKYYG PYVDVYLLRK TLFSIKKLFP LRQRRIPLYK DRTCLNYSIG
RCPGVCQEEI SSEDYKNTLK RVEMIFQGRT DELRILLEKQ MISFSESLKF EEAGSVRDQL
KGIDRLYESQ KMIIPDSSVC RDIIAMASEE NISSVQIFQM RSGKLIGRLG YFSDNSNFNS
SQILQQVIEN HYSNVDPVEI PSEILVQHQL VNNILISDWL SEIKKQKVNI NVPKRSRKAE
IIKLVEKNAN LELQRIKQSH DKNLVELDDL TNILDLENIP KRIECYDISH IQGSDAVASQ
VVFIDGIAAR QHYRRYKIKS PNIKIGHSDD FESMAEVITR RFRRWARFKE EGGDINALLS
NQSSVLDNLN LNDWPDLVVI DGGKGQLSSV VAALEELKLD QNLNVISLAK KKEEVFIPNV
KQSLVTESNQ PGMLLLRRLR DEAHRFAITF HRQKRSQRMK RSQLNEIPGL GPQRIKLLLE
HFRSIEAIQM ATFSELSSTP GLGRSTAVVI RNYFHPDKNK