Gene Cphamn1_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2421 
Symbol 
ID6376116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2579584 
End bp2582703 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content53% 
IMG OID642684899 
Producttype III restriction protein res subunit 
Protein accessionYP_001960797 
Protein GI189501327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCG ACAATCCCAT ACTCAACAGT CCCTACGAAG AACCCCAGCG GCATTATGCA 
ACAGATGCCG ACGGTTCCTT GAATTACAGC GATATCAGGG ATGGGCGTCG GATATTCGTA
CCGGAATTGC AGCCTATCCC AGTTAAGCAG CCCCAAGGAT CATTGCTTGA GATCAACGAT
TTCGCGGCAG AATTTGACAG TCATCTGGTC AATCTGCTGC GCTGTGAGAT TGCGCTTTGG
AGAGTTGCGG AATATCCCGG AACAACAAGA GTTACAAAAG AACTCCTGAT ATTCTGGTTC
AACAACCCTG GTCGTCATGC TGTCCGCAAG CTATTTTTCG CTCAGCGTGA AGCAGTAGAG
ACTGCTGTCT GGCTGAACGA GGTGGCGGAA AAAAGCAACT CTGGTCAGAA CATCATGAGC
ATTCTGCAGA CCGCTCATCG TTCTGTCAGT GCTGAAAAGG AACAGCAGTT GCCACGCCTC
GCATTCAAGA TGGCGACAGG AACCGGAAAG ACCGTTGTCA TGGGGATGCT GATGCTCTAC
CATCTCTTCA ACAGGCGCGA ATACCGACAG GATACACGGT TTGCTGATTA TTTCCTGATT
ATCACCCCAG GCATTACCAT TCGTGAGCGT CTTGGCGTGT TGTTCGTTGA TAAGCACAGT
GCAAGTCGTC AGGAGCGCAC CGATTATTAC GCACTTCGGG ATCTTGTACC CCGGCAATTC
GAGCTCGCTC TCGATGGGCT CAATGCACGC CTGGTGATCA CGAACTACCA TGCCCTGGAG
CCTAAAACCC TTCAGGGCAA CAAGAAAAGC CCGTTCGACG GCAAGCTCGA TGCCGCGGGA
AAGAAGCAGG AAGCCAAAGA GGATTTTGGC AGGCTGATCA ATCGCTTGCT CGGAAGCTTC
AGGAAAGGGA GCCGCCTGCT CGTGCTCAAC GACGAGGCCC ACCACTGTTA CCTGCCAAAA
TCGTCGGGCA GGACAAAGGA TAACGAAGAA TCGGACGAGA ACGCAAAAGC AGCTATCTGG
TTTTCCGGAC TCGTGGAAAT AGCCAAACGC TTCAAGCTGC AACAGGTGTA TGACTTGTCG
GCCACCCCCT ACTACCTGCA GGGATCCGGT TACAAGCCCT ACACGCTGTT TCCCTGGGTG
GTAAGCGACT TCGGGTTGAT CGAAGCTATC GAAGCGGGGC TGGTGAAAAT CCCGTTCATG
CCGCAAAGCG ACAATACACA GGAGCTGGAT ATGCCGGTAT TGCGCAACCT GTATGAGCAT
GTGAGCGATG AACTCCCCAA AAAAGGGCGC AAGAAAAAGA AATCCGAGGC GAAAAAAGAG
GGCGTGTCGA TCACCGAAGA GCCGCCTGTT CTTCCGAAAC TGGTCAAGGG AGCGCTCGAC
CAGTTCTACA ATCACTACCG GGAGTATTGC GACGGCCTGC GCCAGCAGTT CGAGGAAACG
GCCGGATTGT TCACCTCTCC GCCGGTCTTT ATCGTTGTCT GCAACAATAC GTCAGTCTCC
AAGGAGGTTT ACAAGTTCAT CGCAGGGTAT GAATATGAAC GGACGGGCAA GAATGGTAAC
ACGGTCCGCG AAATTGTCGA TGGACACTAT TCGCAGTTCT CCAACTACGA TGCATCGACC
AGGCAACCCA GGCACCGTCC GCCGACGCTC CTTATCGACA GCGACGCGCT TGAAAACTCC
GACCAGATCA ACGACGAGTT CAAAAAAATC TTCGCATCCG AAATTGCAGA GTTCAAACGT
GATTACGCGC GCCTGAAAGG CCAGGGTGCT GCGGAACAGA TCACCGATGC CGAGATTCTC
CGGGAAGTGG TCAACACCGT CGGGCAGCCG GGCAAGCTGG GCGCCCACAT CCGCTGTGTT
GTCTCGGTCT CGATGCTGAC CGAAGGGTGG GACGCCAACA CGGTTACCCA TATCATGGGA
CTTCGCAAGT TCGGCTCCCA GCTTCTCTGC GAACAGGTTG CCGGCCGGGC TCTGCGGAGG
ATGAATTACT ACCTGCAGAC CTACAGGAAA GACACCGGCG ACATTGTTCC CGAAACGGAA
CGTCACCGCT TCAAGCAGGA AAACCTGGTC GAGAAGTTTC CGCCTGAATA CGCGCACATC
ATCGGGGTGC CGTTCAGCAT GTTCAAATCC GGATCGACCA CCCTCACTCC TCCTCCGGAC
TACACCCATG TTACGGCTCT GCCTGAACGC CATCAGGAGC TTGAAATCAC GTTCCCCAAC
GTCGTCGGCT ACCGGACGGA ATATCTCGAC AAAGGGATCG TTCACGATTT CAGCGGTATC
GAGAACTATG AACTGGATTT TTCGAAGTTT CCTACCGAGA TCGTGATGGC ATGTCCGTTC
TCCCCGCATC AGGAAACCAT GCAGGTGACA TCCGTTCTGG AGAGACGGGA CCAGGAACTG
CTCTACCTGA TCACGAAGGA GCTGATCCGC TACCATTTTG CCGACGACGA CCAGAATCCT
CGTTTTCAGC TCTTCGGCGA TCTGAAAAAT ATCGTCGAGG AGTGGTACGA CACAAAGATT
GTGCTTCTGA ACCAGTCGGA TGAACGATAC CGGCGACTGC TCTACTTCGA GAACGGCAAA
ACCATCGCCG ACCATATTGC ACGGGGTATC AACCCGCACA TCAACACGGA AGAATATATC
CGGCCGGTCT TCAACTACTA CAATCGCTTT GGTAGTACGA AATACGTCAG TGGCAATACC
ACGAAAGAAA CCTGGCCGAC GTCGAAAAGC CATGTCAATG CAGTTGTCAT GGACAGTGAC
TGGGAGGCCA TTGCAGCCAA GACACTGGAG GAGATTCCCG AGGTCGTTTC CTATGTCAAG
AACCAGTTTC TCGGTTTCAC GATCCCGTAT GTGAAGGATG GCAAGGACAA GCTCTACTAT
CCTGATTTCC TTGTCCGTCA CGTAACTCCA ACCAGAGAAA CCGCCAACCT GATCATCGAG
ATCAGCGGCA TGAGCAAGGA CAAGGCCGAA AAGAAATGGT TCGTGCACAA CCGCTGGCTG
CCGGCCGTGA ATGCCGTGCA GGAAAAATAC GGACTCGGCC GCTGGCACTT CATCGAGATC
GCCAACGATA TCCGCGACAT CAGGACCCAG TTGGCGGAAA ATATTAAAAT CAACCTATAA
 
Protein sequence
MTTDNPILNS PYEEPQRHYA TDADGSLNYS DIRDGRRIFV PELQPIPVKQ PQGSLLEIND 
FAAEFDSHLV NLLRCEIALW RVAEYPGTTR VTKELLIFWF NNPGRHAVRK LFFAQREAVE
TAVWLNEVAE KSNSGQNIMS ILQTAHRSVS AEKEQQLPRL AFKMATGTGK TVVMGMLMLY
HLFNRREYRQ DTRFADYFLI ITPGITIRER LGVLFVDKHS ASRQERTDYY ALRDLVPRQF
ELALDGLNAR LVITNYHALE PKTLQGNKKS PFDGKLDAAG KKQEAKEDFG RLINRLLGSF
RKGSRLLVLN DEAHHCYLPK SSGRTKDNEE SDENAKAAIW FSGLVEIAKR FKLQQVYDLS
ATPYYLQGSG YKPYTLFPWV VSDFGLIEAI EAGLVKIPFM PQSDNTQELD MPVLRNLYEH
VSDELPKKGR KKKKSEAKKE GVSITEEPPV LPKLVKGALD QFYNHYREYC DGLRQQFEET
AGLFTSPPVF IVVCNNTSVS KEVYKFIAGY EYERTGKNGN TVREIVDGHY SQFSNYDAST
RQPRHRPPTL LIDSDALENS DQINDEFKKI FASEIAEFKR DYARLKGQGA AEQITDAEIL
REVVNTVGQP GKLGAHIRCV VSVSMLTEGW DANTVTHIMG LRKFGSQLLC EQVAGRALRR
MNYYLQTYRK DTGDIVPETE RHRFKQENLV EKFPPEYAHI IGVPFSMFKS GSTTLTPPPD
YTHVTALPER HQELEITFPN VVGYRTEYLD KGIVHDFSGI ENYELDFSKF PTEIVMACPF
SPHQETMQVT SVLERRDQEL LYLITKELIR YHFADDDQNP RFQLFGDLKN IVEEWYDTKI
VLLNQSDERY RRLLYFENGK TIADHIARGI NPHINTEEYI RPVFNYYNRF GSTKYVSGNT
TKETWPTSKS HVNAVVMDSD WEAIAAKTLE EIPEVVSYVK NQFLGFTIPY VKDGKDKLYY
PDFLVRHVTP TRETANLIIE ISGMSKDKAE KKWFVHNRWL PAVNAVQEKY GLGRWHFIEI
ANDIRDIRTQ LAENIKINL