Gene CA2559_04670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_04670 
Symbol 
ID9296424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1057562 
End bp1059358 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content35% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003715698 
Protein GI298207519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAC CACCTTTAGA CATACAAGTA AAGACTTTGC CAAGCAGTCC TGGCGTGTAC 
CAATATTTTG ATAAGGATGA TAAAATCTTA TATGTTGGAA AGGCTAAAAA CTTAAAAAAA
AGGGTGAATT CCTACTTTAC TAAAAAGCAC GACAGTCATC GTATTGGTGT GATGGTAAAG
AAGATTAAAA ACATAAAGCA TATAGTAGTT AATTCTGAAA CAGATGCACT TCTGCTAGAA
AACAACTTAA TTAAAAAACT GAAGCCTCGT TTTAATGTCA TGCTTCGTGA TGACAAAACC
TATCCTTGGC TTTGCATTAA AAAAGAACGC TTTCCTAGAG TATTTCCTAC GCGACGTGTT
ATTAAAGATG GCAGCGAATA TTATGGTCCT TTTACAAGTA TGAAAACGGT ACATACCTTG
CTTGACCTTA TTAAAGGCCT ATACCAACTT AGGACTTGTA AATATGATTT AGCTGAAGAA
AAGATAGATG CCGGTAAATA CAAAGTTTGC TTAGAGTATC ATTTAGGAAA CTGTCAAGGT
CCTTGTGAAG GTTACCAAAC AGAAGAGGAA TACCATAGCA ATATTGATCA TATAAGACAG
ATTGTAAAAG GTAATTTTAA AGATTCGCTC ATTAAATTTA AAGACCAGAT GAAAGGTCAC
GCAGAGCGTA TGGAGTTTGA AGATGCTCAG CGCATAAAAG AAAAGATACA AGTTTTAGAA
AAGTACCAAA GCAAATCTAC AGTAGTAAAT CCTAAAATTA ATAATGTTGA TGTATTTTCA
ATTGTTAGTG ATGAAGGCTT TGGGTACGTT AATTTTTTAC AGTTATCTCA CGGCTCAATA
ATAAGATCTC ACACTATTGA ACTTAAAAAG AAGTTAGATG AAGGTGACCA AGAGTTACTA
GAGCTTGGTA TTATTGAAAT ACGACAACGC TTTAACTCTC AATCTAAAGA AATTTATGTG
CCTATGCATG TTGAGGTAGG TGAAGATTTA AAGGTGACCG TTCCTAAGCT AGGAGACAAG
AAAAGTATTG TAGACCTTTC TACCAGAAAC GCTAAATATT TTAGGCAAGA GCGTTTTAAA
CAAATGAAAA TTGTAGACCC AGATAGGCAC GTTAAGCGTA TTATGGCGCA AATGAAAGAA
GATTTACGAC TTAATGAAGA ACCAAGACAT ATTGAATGTT TTGACAACTC TAACATTCAA
GGTACAAATC CTGTAGCAGC ATGTGTGGTG TTTAAAGATG GTAAACCTAG CAAAAGCGAG
TATAGAAAAT TTAATATTAA AACTGTTGAA GGTCCAGATG ACTTTGCCAG TATGGAGGAA
GTGGTTTTTA GACGTTATAG GCGTCTTTTA AATGAAGACC AACCCTTACC GCAATTAATT
ATAATTGATG GTGGTAAAGG CCAACTATCT TCTGCCGTAA AAGCATTAGA CGACTTAGGT
TTACGTGGAA AGATTGCTAT TGTAGGTATT GCTAAACGCC TAGAGGAAAT ATTTTACCCA
ACAGATAAAT ACCCTTTGTA TCTAGACAAA AAATCTGAGT CTTTAAAAAT AATACAACAC
CTTAGGAATG AAGCACACCG CTTCGGTATT ACGTTTCACA GACAAAAACG AAGTAGCGCT
GCTTTAGGCA CTGAACTTGA AAATATAACT GGAATAGGTG AAAAAACTGC AGTCGAGTTA
TTAAAACACT TTAGAAGTAT TTCAAAAATT AAATCGGCTA AAAGAGAAGA CCTTGAACAG
GTTGTTGGTG TAGCAAAGGC AAGCTTGGTT TATGAATTTT ACAATAAAGA AAACTAG
 
Protein sequence
MERPPLDIQV KTLPSSPGVY QYFDKDDKIL YVGKAKNLKK RVNSYFTKKH DSHRIGVMVK 
KIKNIKHIVV NSETDALLLE NNLIKKLKPR FNVMLRDDKT YPWLCIKKER FPRVFPTRRV
IKDGSEYYGP FTSMKTVHTL LDLIKGLYQL RTCKYDLAEE KIDAGKYKVC LEYHLGNCQG
PCEGYQTEEE YHSNIDHIRQ IVKGNFKDSL IKFKDQMKGH AERMEFEDAQ RIKEKIQVLE
KYQSKSTVVN PKINNVDVFS IVSDEGFGYV NFLQLSHGSI IRSHTIELKK KLDEGDQELL
ELGIIEIRQR FNSQSKEIYV PMHVEVGEDL KVTVPKLGDK KSIVDLSTRN AKYFRQERFK
QMKIVDPDRH VKRIMAQMKE DLRLNEEPRH IECFDNSNIQ GTNPVAACVV FKDGKPSKSE
YRKFNIKTVE GPDDFASMEE VVFRRYRRLL NEDQPLPQLI IIDGGKGQLS SAVKALDDLG
LRGKIAIVGI AKRLEEIFYP TDKYPLYLDK KSESLKIIQH LRNEAHRFGI TFHRQKRSSA
ALGTELENIT GIGEKTAVEL LKHFRSISKI KSAKREDLEQ VVGVAKASLV YEFYNKEN