Gene Cpha266_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0017 
Symbol 
ID4568905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp12686 
End bp14092 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content56% 
IMG OID639764621 
Producttransposase, IS4 family protein 
Protein accessionYP_910514 
Protein GI119355870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTCC CTTACACCAA TATACACACT GACCAACCCA ACCAACAAGC GCTATTTCCT 
GATTGTTTCG AAGTATCCGT TGCCCCTGTC AAGGGCAAAA AAGTGGTTCT TGATTTCCAG
GGAGGCAACG CCACCAGCGA TGCCGGTGTC CTGCTGTTGA AGGAAGTCGA GTCCATGACC
AGGATCGTTC CGAAGCTTGC CGATTGCATT GCTGATTCCC GTCGGACCTC ATCTGTCATG
CATTCGATCC CTGACCTGAT CGCCCAGCGG GTCTACCAGA TCGCCTGCGG CTATGAGGAC
GGCAACGACA GCAATTCCAT GCGGAAGGAT CCCGCTCTCA AGATGGCCCT CAACCGTCTT
CCTGAAAGCG GCGATGACCT TGCCAGCCAG CCTACCTTCA GCAGGCTGGA GAACATGGTT
ACCCGTCCGG AGCTCTATCG TATGGCTGTC GGGTTCCTCG ATCATTTCCT TGACTCCTAC
ACCGAGGCAC CGCGGGTCAT CGTCCTGGAC TTTGACGATA CCGAGGATGT TGTTCATGGC
AAACAGCAGC TGGCGCTTTT CAACGGCTAT CACCAGGAGA CCTGTTACCA GCCTCTCCAT
GTTTTCGAGG GGTTGACTGG CAAGCTGATC GCCTCGATCC TTCGCCCCGG CAGGCGTCCT
ACCGGCAAGG AGATTGTATC ATACGTGAAG CGCATTGTCC GCCATATCCG GAGCAGGTGG
CCGGAAACAA TCATCGTCTA CCGCGGCGAC AGCCATTACG GTGTGCCGGA AGTCTACTCC
TTCCTTGCCA GAGAGCAGAA CTGCTACAGC GTGACCGGCC TCGGCGGTAA TGACGTGCTG
CTTCGCTCCG TCAAGGACAT TATTGAGGAG GTCAAGAAGC ATGGAGCCGG ATACCGCCGT
TACCATACCT TTCAGTATCA GGCACGGAGC TGGAAGGAGA CCCGCAGAGT GGTCGCCAAG
GTCGAGATGA CCGAAAAGGG GCTGAACGTG CGCTTCATCA GCACCGACAT GCAGGAGGCA
AAGGCCAAGA CTCTGTACGA GCAGATTTAC AGTGCACGTG GCAACGATGA ACTCTACATC
AAGGCGCATA AAACGTTCAT GAAAAGCGAC CGGACCTCAT GCCATCGCTT TCTTGCCAAC
CAGTTCAGGG TCTTCCTGCA TTCGGCGGCC TATGTCCTGG TCCACGCCTT CCAGACCAAC
CTGCTCCGGG GCACCGCCCT TGCCACGGCG ACCTTCGAGA CAATCCGATT GAAGCTCCTG
AAAATCGGAG CGAAGGTCAT TGAGATGAAG ACACGCATCA AGGTGCATCT GCCGACCTCA
TATCCGTACA AACCGATACT GAACAAGTGC TTCGCCGTCC TTGAGCACCT GCGATCAGTC
CCATGGCCAT CAACAGCAAT TCCGTAA
 
Protein sequence
MQLPYTNIHT DQPNQQALFP DCFEVSVAPV KGKKVVLDFQ GGNATSDAGV LLLKEVESMT 
RIVPKLADCI ADSRRTSSVM HSIPDLIAQR VYQIACGYED GNDSNSMRKD PALKMALNRL
PESGDDLASQ PTFSRLENMV TRPELYRMAV GFLDHFLDSY TEAPRVIVLD FDDTEDVVHG
KQQLALFNGY HQETCYQPLH VFEGLTGKLI ASILRPGRRP TGKEIVSYVK RIVRHIRSRW
PETIIVYRGD SHYGVPEVYS FLAREQNCYS VTGLGGNDVL LRSVKDIIEE VKKHGAGYRR
YHTFQYQARS WKETRRVVAK VEMTEKGLNV RFISTDMQEA KAKTLYEQIY SARGNDELYI
KAHKTFMKSD RTSCHRFLAN QFRVFLHSAA YVLVHAFQTN LLRGTALATA TFETIRLKLL
KIGAKVIEMK TRIKVHLPTS YPYKPILNKC FAVLEHLRSV PWPSTAIP