Gene Cphamn1_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2157 
Symbol 
ID6375851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2333372 
End bp2334778 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content56% 
IMG OID642684644 
Producttransposase IS4 family protein 
Protein accessionYP_001960543 
Protein GI189501073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.360609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCC CTTTCGCTAA TATACACACT GAACAGCCCA CCCAACAAGC GCTATTTCCT 
GATTGTTTCG AAGTATCCGT TGCCCCGGTC AAGGGCAAAA AAGTGGTTCT TGATTTCCAG
GGCGGCAACA CCACCAGCGA TGCCGGTGTC CTGCTGTTGA AGGAAGTCGA GTCCATGACC
AGGATCGTTC CGAAGCTTGC CGATTGCATT GCCGATTCGC GTCGGACCTC GTCTGTCATG
CATGTGATCC ATGACCTGAT CGCCCAGCGG GTCTACCAGA TCGCCTGCGG CTATGAGGAC
GGCAACGACA GCAATTCCCT GCGGAAGGAT CCCGCTCTCA AGATGGCCCT CAACCGTCTT
CCTGAAAGCG GCGATGACCT TGCCAGCCAG CCGACCTTCA GCAGACTCGA AAACATGGTT
ACCCGTCCGG AGCTCTATCG TATGGCTGTC GGGTTCCTCG ATCATTTCCT TGACTCCTAC
ACCGAGGCGC CGCGGGTCAT CGTCCTGGAC TTTGACGATA CCGAGGATGT CGTTCACGGC
AAACAGCAGC TGGCGCTTTT CAACGGCTAT CACCAGGAGA CCTGCTACCA GCCTCTCCAT
GTTTTCGAGG GGTTGACTGG CAAGCTGATC GCCTCGATCC TTCGCCCTGG CAGGCGTCCT
ACCGGCAAGG AGATTGTGTC ATACGTGAAG CGTATTATCC GCCATATCCG GAGCCGGTGG
CCGGAAACGA TCATCGTCTA CCGCGGCGAC AGCCATTACG GCGTGCCAGA AGTCTACTCC
TTCCTTGCTT CAAAGCGGAA CTGCTACAGC GTGACCGGCC TCGGCGGTAA TGACGTGCTG
CTCCGCTCCG TCAAGGACAT TATTGAGGAG GTCAAGAAGC ATGGAGCCGG ATACCGCCGT
TACCATACCT TCCAGTATCA GGCACGGAGC TGGAATGGGA GCCGCAGAGT GGTCGCCAAG
GTCGAGATGA CCGAAAAGGG GTTGAACGTG CGCTTCATCA GCACCGACAT ACAGGAGGCA
AAGGCCAAGA CTCTGTACGA GCAAATTTAC AGCGCTCGTG GCAACGATGA ACTCTACATC
AAGGCGCATA AAACGTTCAT GAAGAGCGAC CGGACCTCGT GCCATCGCTT TCTTGCCAAC
CAGTTCAGGG TCTTCCTGCA TTCGGCGGCC TATGTCCTGG TCCACGCCTT CCAGACCAAC
CTGCTCCGGG GCACCGCCCT TGCCACGGCG ACTTTCGAAA CGATCCGCTT GAAGCTGCTG
AAAATCGGGG CGAAAGTCAT CGAGATGAAG ACACGCATCA AGGTGCATCT GCCGACCTCA
TATCCGTACA AACCGATACT GAACAAGTGC CTCACCATCC TTGAGCACCT GCGATCAGTC
CCATGGCCAT CAACAGCAAT TCCGTAA
 
Protein sequence
MQLPFANIHT EQPTQQALFP DCFEVSVAPV KGKKVVLDFQ GGNTTSDAGV LLLKEVESMT 
RIVPKLADCI ADSRRTSSVM HVIHDLIAQR VYQIACGYED GNDSNSLRKD PALKMALNRL
PESGDDLASQ PTFSRLENMV TRPELYRMAV GFLDHFLDSY TEAPRVIVLD FDDTEDVVHG
KQQLALFNGY HQETCYQPLH VFEGLTGKLI ASILRPGRRP TGKEIVSYVK RIIRHIRSRW
PETIIVYRGD SHYGVPEVYS FLASKRNCYS VTGLGGNDVL LRSVKDIIEE VKKHGAGYRR
YHTFQYQARS WNGSRRVVAK VEMTEKGLNV RFISTDIQEA KAKTLYEQIY SARGNDELYI
KAHKTFMKSD RTSCHRFLAN QFRVFLHSAA YVLVHAFQTN LLRGTALATA TFETIRLKLL
KIGAKVIEMK TRIKVHLPTS YPYKPILNKC LTILEHLRSV PWPSTAIP