Gene Cphamn1_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0829 
Symbol 
ID6374496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp892653 
End bp893711 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID642683337 
Producttransposase IS4 family protein 
Protein accessionYP_001959261 
Protein GI189499791 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TCAATCCTCT TGGCCTTTTC GACGAACATT TTCTGCTGGA ACGGCTCACC 
AAGCTCAAAG ATCCATTGGT AAAACTGGAT ACATATATCG ACTGGAACAT CTTTGCGCCT
ATCCTGAATG TTGTCTTCAG TAAGCCTGAA AACAGTAGCA AAGCAGGTCG CCCTCCGTTT
GATAGAGTCA TGATGTTCAA ACTGCTCATT CTACAAAGCT TGTATAGTCT CTCCGATGAT
CAGATGGAGT TCCAAATAAC AGACAGGCTG AGCTTCAAGC GATTTCTGAA GCTGAAGACC
ACCGACAAGG TTCCCGACAG CAAGACCATC TGGAAGTTCC GTGAAACCCT CATCCAAGAA
GGGGTTATCG AAGCTCTGTT TCACCGGTTC AATGAGGCCC TTGACGACCA GTCCGTCTTT
GCAAATACCG GCCAGATTGT CGATGCCAGT TTTGTTGAAG TGCCCCGTCA GCGCAACACA
CGGGACGAGA ACCAGCAGAT CAAGAAAGGC GAAACCCCTG AAGCTTGGAA AGCAAGACCC
AACAAACTTC GTCAAAAAAA TCGTGACGCC CGCTGGACCA AGAAAAATAA GATGTCTTTC
TATGGCTACA AGAACCATAT AAAAGCCGAC AAGGGAACAA AGCTCATCAG CGACTACATG
GTTACCGATG CTTCAGTTCA TGATTCACAG GAGCTTGAAA CCCTTATCAG TACCGACGAT
GGCGGTCAGA AGCTGTACGC AGACGCAGCC TATATTGGAC AGGAAGAAAC TATCGAAAGC
AGTGGTATGA GGAATATGGT TCATGAAAAA GGCAACAGGT ACCATAAACT CACCGATGCC
CAGAAGGCTT CGAACAAAGA AAAGTCTCGT ACCCGCGCCA GAGTTGAACA TGTGTTCGGC
TTCATGACCA ATTCTATGAA CGCCATGTCC ATCAGAACCA TTGGCTACAT ACGGGCAACA
GGCAAGATTG GATTAGCCAA CTTGACCTAT AACATGATGC GCTGCACACA GTTGAAGAAG
AAAGTGCACA ATGTTTTCCT GCGGGATAGC TACGCCTAA
 
Protein sequence
MKNINPLGLF DEHFLLERLT KLKDPLVKLD TYIDWNIFAP ILNVVFSKPE NSSKAGRPPF 
DRVMMFKLLI LQSLYSLSDD QMEFQITDRL SFKRFLKLKT TDKVPDSKTI WKFRETLIQE
GVIEALFHRF NEALDDQSVF ANTGQIVDAS FVEVPRQRNT RDENQQIKKG ETPEAWKARP
NKLRQKNRDA RWTKKNKMSF YGYKNHIKAD KGTKLISDYM VTDASVHDSQ ELETLISTDD
GGQKLYADAA YIGQEETIES SGMRNMVHEK GNRYHKLTDA QKASNKEKSR TRARVEHVFG
FMTNSMNAMS IRTIGYIRAT GKIGLANLTY NMMRCTQLKK KVHNVFLRDS YA