Gene P9303_14941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14941 
SymboluvrC 
ID4777934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1297351 
End bp1299354 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content45% 
IMG OID640087004 
Productexcinuclease ABC subunit C 
Protein accessionYP_001017505 
Protein GI124023198 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.326891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTAA CCTCGCTTCA ACGGGTTGGT CATACGGCGG TTGAAAATAA TTTGACCAAC 
GCTTCTCAGC GCACCATCCT TCTAAAGGAT GAAAAAAGAC TTGAGCAAAG GCTTAAGGAG
ATTCCAGCTG AACCTGGTTG TTATCTCATG CGTGATGGAG AAGACAGGTT GTTATATGTG
GGTAAATCAA AATGTTTACG AAATCGAGTA AGAAGTTATT TCCGAAGTAG CAGTGATCAT
GGGCCTCGTA TTCGATTGAT GGTGCGACAA ATTGTTGAGA TTGAATTTAT CGTAACTGAT
AGCGAGGCTG AATCCTTAGT TCTGGAATCT AACCTGATAA AGAATCAACA GCCACATTTC
AATGTGTTGT TAAAAGATGA CAAAAAATAT CCCTATATTT GCATTACATG GAGCGAGGAG
TATCCACGTA TCTTTATCAC AAGGAGGCGA CGTTTCCGAA ATAAGAATGA TCGCTTCTAT
GGCCCATATG TAGATGTAGG ACTATTGCGT AGAACCTTAT TTCTTGTCAA ACGTTCGTTC
CCGCTGAGAC AACGACCACG GCCATTGCAC CAGGATCGTA CCTGCCTTAA CTATTCAATT
GGACGTTGCC CTGGCGTATG CCAGCAGAAG ATCACACCAA AGGATTATCA TCAAGTTCTT
CGCAAAGTTG CGATGGTTTT TCAGGGCAGG AATCAAGAGC TTAAGGTTTT GCTTGAGAGG
CAAATGGAAC GTTATTCAGA TCGATTGGAC TATGAGTCAG CCGCTAACAT AAGGGATCAG
ATTAAAGGGT TAGAACAACT TACAGAAGAA CAGAAGATGA GCTTACCAGA TTCAAGCGTA
AGCCGAGATG TACTTGCTAT TGCAAGCGAT CATAGGGTTG CGGCAGTTCA GCTTTTTCAG
ATGCGTGCAG GTAAACTTGT TAATCGTCTT GGCTTTACTG CTGATGCAGT CGATCAAACT
CTTGGAAGTG TACTTCAGCG AGTTATCGAA GAACATTACA GTCAAGTAGA TGCTGTAGAA
GTCCCGCCAG AAGTTCTTGT TCAATACTCA TTACCTCAGC AAGAATTACT AATTGATTGG
CTCAGTGAGC AACGCGGTCG GCGTGTTCAA ATCAGTTGTC CTCAACGCCA AGCAAAAGCA
GAGTTGATAG AACTTGTTGA ACGCAATGCA GTATTTGAAT TATCTCGTGC GAAGAGTGGG
CAGCAACAAC AGGAATTGGC AACAGAGGAT CTTGCCCAGT TACTTGAACT CACGACACTG
CCAAGAAGAA TCGAAGGATA TGATATTAGT CATATCCAAG GAAGTGATGC TGTTGCATCA
CAGGTTGTAT TTATCGATGG GCTACCCGCT AAACAGCATT ACCGAAAGTA TAAAATTAAA
AGTAGTAGTA TCAAATCTGG CCATAGCGAT GACTTCATGG CGATGGCTGA GATTATGCGC
CGTCGCTTTC GTCGTTGGGC AAGGGTGAAG CAAGAAGGAT CAGATTTTGA AAAACTGCAG
CGCTGCAGTG GCAGCACCTT GCAGACAGAT GGTCTCAATG ATTGGCCTGA TGTGGTGATG
ATTGATGGCG GTAAGGGTCA GCTTTCATCA GTCATGGAAG CATTACGCGA GCTTGACCTT
CATGAGGATC TCGTCGTTTG TTCATTGGCT AAGAAACATG AACAGATCTT TGTACCTGGA
CAAAGCAAAC CATTGGATTC TGATCCTGAT CAGTTGGGTG TCGTCCTTTT AAGGCGGCTT
CGGGATGAAG CACATCGCTT TGCCGTGAGT TATCACCGCC AACAAAGAGG GGTAAGAATG
AACAGATCTC GGCTTACAGA CATCCCTGGA CTTGGGCCTA GGCGTGTTCG TGACCTTCTT
GCACACTTTC AATCCATTGA TGCTATTCAG TTGGCTAGCG TCCAGCAAAT TAGTCAGGCA
CCAGGGCTTG GACCAGCACT TGCATTTCAG GTTTGGACCT ATTTCCACCC TGAAGCCGAT
AAGGCGTTGG AGGAGGTTGC CTGA
 
Protein sequence
MRLTSLQRVG HTAVENNLTN ASQRTILLKD EKRLEQRLKE IPAEPGCYLM RDGEDRLLYV 
GKSKCLRNRV RSYFRSSSDH GPRIRLMVRQ IVEIEFIVTD SEAESLVLES NLIKNQQPHF
NVLLKDDKKY PYICITWSEE YPRIFITRRR RFRNKNDRFY GPYVDVGLLR RTLFLVKRSF
PLRQRPRPLH QDRTCLNYSI GRCPGVCQQK ITPKDYHQVL RKVAMVFQGR NQELKVLLER
QMERYSDRLD YESAANIRDQ IKGLEQLTEE QKMSLPDSSV SRDVLAIASD HRVAAVQLFQ
MRAGKLVNRL GFTADAVDQT LGSVLQRVIE EHYSQVDAVE VPPEVLVQYS LPQQELLIDW
LSEQRGRRVQ ISCPQRQAKA ELIELVERNA VFELSRAKSG QQQQELATED LAQLLELTTL
PRRIEGYDIS HIQGSDAVAS QVVFIDGLPA KQHYRKYKIK SSSIKSGHSD DFMAMAEIMR
RRFRRWARVK QEGSDFEKLQ RCSGSTLQTD GLNDWPDVVM IDGGKGQLSS VMEALRELDL
HEDLVVCSLA KKHEQIFVPG QSKPLDSDPD QLGVVLLRRL RDEAHRFAVS YHRQQRGVRM
NRSRLTDIPG LGPRRVRDLL AHFQSIDAIQ LASVQQISQA PGLGPALAFQ VWTYFHPEAD
KALEEVA