Gene Cphamn1_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0211 
Symbol 
ID6373866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp206045 
End bp207199 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content51% 
IMG OID642682729 
Productintegrase family protein 
Protein accessionYP_001958665 
Protein GI189499195 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000967186 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGTTT CATCCGTACA CTTGCGGAAG CGGAAGCAGG GGAAGAGCGG GCGCATAAGC 
CTGTATCTTG AGTTTTACAA GGGCGCGGTT ACACAGCCTG ACGGGAAAGC GAAGGTTCTC
AGGGATTACG AATACCTGAA CCTGTATCTT GACGACAAGC CGAGAACAGC GGCGGAAAAA
GAGCATAACA AGAATATACT TGAGCTTGCC AAGTCGATCA AGGCAAAGCG GGAACTTGAG
ATCAAGAACG GGCAGTACGG GTTTGACTCC TGCGTCAAGG CAAAAGCCCT GTTTCTCTCT
TACTTCAAGG CTGAAGCGGA AAAGAAATCG AAGCCGGGCT ACCCCGGTAA TTGGGGCAGT
ACTCTCAAGC ACCTCACCAG ATTTGTCGAG AAGCACCGTT CTGTTCGGGT CACCTTCCGG
GAGATCGACA AGGCGTTTTG CGAAGGGTTC AAGGACTATC TCCGGGATGA GGCGACAACG
AGAACGGGGA AAGGGCTCTC TTCAGCGTCT CAGGGTGCTT ATTACGGGAA GTTCAAGGCT
TGCTTGAATA AGGCTATAAA GGACGGAATT CTGTCCGTTG ATCCTGCAAA GGGCGTGGCG
CGCCCGAAGA TCGTTTCACA CAAGCGGGAA TATCTGACAT TCGACGAACT TCAGGCAATG
GCTAAGGCTG AGTGCCGGAA CCCTACGCTG AAGCGGATGT TCCTGTTCTC TTGCTTGACC
GGGTTACGTT TTTCTGATTG TCATAAACTG ATATGGGGTG AAGTGGAACA GTACGGCGAC
GGGTGGCGGA TCGTATTTCA CCAGCAGAAG ACGAAAGGAC TTCAGTATCA CGACATTTCA
CAGCAGGCGC GGGAGCTGAT GGGGGAACAG GGCGCGGCTG ATGACCGGGT GTTCTTCGCC
ATAAGCAAGT ATTCGGCGTA TCTCAGTATC GTTCTCCGGG AATGGGTTTT GAAGGCAGGC
ATAACAAAAC ACCTGACGTT TCATTCAGGC CGTCACACCT TCGCAGTGTT ACAACTGGAG
AATGGGACAG ACATTTACAC ACTCAGCAAG CTATTGGGAC ATAGAGAGAT CGAGGTAACG
GCTATTTATG CCGATATTCT GGATAAGAAG CGGCGTGAGG CGATGACTGA GCGGATTCCT
GAACTGAGTT TATGA
 
Protein sequence
MDVSSVHLRK RKQGKSGRIS LYLEFYKGAV TQPDGKAKVL RDYEYLNLYL DDKPRTAAEK 
EHNKNILELA KSIKAKRELE IKNGQYGFDS CVKAKALFLS YFKAEAEKKS KPGYPGNWGS
TLKHLTRFVE KHRSVRVTFR EIDKAFCEGF KDYLRDEATT RTGKGLSSAS QGAYYGKFKA
CLNKAIKDGI LSVDPAKGVA RPKIVSHKRE YLTFDELQAM AKAECRNPTL KRMFLFSCLT
GLRFSDCHKL IWGEVEQYGD GWRIVFHQQK TKGLQYHDIS QQARELMGEQ GAADDRVFFA
ISKYSAYLSI VLREWVLKAG ITKHLTFHSG RHTFAVLQLE NGTDIYTLSK LLGHREIEVT
AIYADILDKK RREAMTERIP ELSL