Gene Cphamn1_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0043 
Symbol 
ID6373686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp46330 
End bp47856 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content50% 
IMG OID642682565 
Producttransposase IS4 family protein 
Protein accessionYP_001958513 
Protein GI189499043 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGGG ACTTGAAAAT CAGAAAGGTG CGAACTGCAT CCGGAGCAAC TGCCGTACAA 
GTAGTCCAGA ATAAAGGGAC GGGGCGTTCT TTCCTCAAAC ATATTGGCAG TGCTCATGAT
GAGCACGAGC TGGAATTGTT GCTGGATGAG GCAAGAAAAT TCGTCGAAGC TCATTGTCGC
CAACCAAGCC TTTTTTCGGA TACTGATACG CCGTCTACAC CATCGCTGTT TAAGACTGTT
CTTGACAAGT CAAGCGCTGT TGGCGTCACT CATCAGTTTG CCCGCAATGC GCTCCTTGCC
TGTGCCCGAA AATGTGGCCT GGGCTCGTTG CCGGAGCTGT ATCTTGATCT GGCACTCATG
CGCATCATTG AGCCAACATC GAAGTTGCGT TCGCTGGATC TTCTTGAGTC GTATTTCCAT
GTCAGCTATG CGAAACGAAC ACTCTATCGC CTGTTCCCCA AGCTCTTGGG GTATCAGGAG
GAGATCGAAA CCGCAGCCAT CCAAACCGCT CGAAGAGAAT TGCAGGAGCA GTTCAGTCTG
GTACTGTACG ATGTTACCAC GCTGTACTTC GAGTCCTTCA AGGAGTACGA TTTCCAGCGT
CCCGGATTCT CAAAGGACAA CAAACCCCAG CAGCCGCAAA TCGTCATTGG CTTGATCACC
ACCCGCTCAG ATTTTCCTGT CATGCATGAG GCGTTTGAAG GCAACACCTT CGAAGGGAAA
ACAATGCTCA AGGTCATTCA CCGCTTTCAG GAGCGTGTTG GCGAAACCAA GCCGATTATT
GTGGCCGATG CCGGCATGCT CTCGAAAGAC AACATCCTGA AACTGGAGAA TGAAGGATAC
CGCTACATCG TAGGGGCTCG GATGGCGAGT ACTGCGGTGA GCTTCATTGA TCAGGTTTAC
AAAGCACTAC CTCGTACCGA TAAGGCTCTA CACCGCTTCA GCTACAAGTC TGCCGTAAAG
AATGCCACCA TGATCTGTGA GTTCTCGGAG TCCCGATACA AAAAAGACAA GCGAGAGTTC
GATAAGCAGG TCAAGCGGGC ACTTACCTTG CTTGAAAAAG ATGAACCCGG CCGACGGGCA
AAATTCGTCA AGAAAACCAA AACAACCGAC AAGCCCTATA TCTTCGATAC CAATCTTCAG
GCAAAGGCAG AGAAGCTTCT TGGCATCAAA GGCTATGTGA CCAATATCCC TGAGAAGGAA
CTGTCCAGTC GTGCGGTTAT CGACTACTAT CATGATCTCT GGAACGTAGA GCAGGCTTTT
CGCATGAGCA AATCCGACCT ACAGGCCAGA CCGATCTTTC ATCACACGGA AGATGCCATT
AGGGCTCATA TGCTCGTCTG CTTTATGGCC TTGATGATGG GCAAACTTCT CGAAATAAAA
ATGGGTCGAT CGTTACGCCA GATTCGGGAA AAAATCTGGG CGGTTCATGA AATCCATCTT
TGCTATGAGC GAACCGGTGA GGTTTGTGTC ATGCAGATGG GCACAAGCGA ATTTACCAAC
AAGATTCAAC GATTTCTTGA GCTCTGA
 
Protein sequence
MRGDLKIRKV RTASGATAVQ VVQNKGTGRS FLKHIGSAHD EHELELLLDE ARKFVEAHCR 
QPSLFSDTDT PSTPSLFKTV LDKSSAVGVT HQFARNALLA CARKCGLGSL PELYLDLALM
RIIEPTSKLR SLDLLESYFH VSYAKRTLYR LFPKLLGYQE EIETAAIQTA RRELQEQFSL
VLYDVTTLYF ESFKEYDFQR PGFSKDNKPQ QPQIVIGLIT TRSDFPVMHE AFEGNTFEGK
TMLKVIHRFQ ERVGETKPII VADAGMLSKD NILKLENEGY RYIVGARMAS TAVSFIDQVY
KALPRTDKAL HRFSYKSAVK NATMICEFSE SRYKKDKREF DKQVKRALTL LEKDEPGRRA
KFVKKTKTTD KPYIFDTNLQ AKAEKLLGIK GYVTNIPEKE LSSRAVIDYY HDLWNVEQAF
RMSKSDLQAR PIFHHTEDAI RAHMLVCFMA LMMGKLLEIK MGRSLRQIRE KIWAVHEIHL
CYERTGEVCV MQMGTSEFTN KIQRFLEL