Gene Cphamn1_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2371 
Symbol 
ID6376066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2538345 
End bp2539880 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content53% 
IMG OID642684853 
Producttransposase IS4 family protein 
Protein accessionYP_001960751 
Protein GI189501281 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAG ACTTGACAAT CAGAAAGGTG CGAACTGCAT CTGGTGCAAC TGCCGTTCAG 
GTGGTTCAGA ATAAAGGGAA ACAGCGTTCC TTTCTCAAGC ATATCGGCAG TGCTCATAAT
GAGCAGGAAC TCGAACTGTT GATGGCTAAA GCCGAGAAAT ACGTTGAAAC CCATTGCAGA
CAGCCAAGCC TGTTTGCTGA CACTCCACCA TCACCTTCAT CGCTCCAGAC AGCTCTTGCA
GGCTCCAAGC TTGTCGGCGT CACCCATCAG TATGCGCGCA ATACGCTTCA TGCCTGCGCT
CGAAAATGCG GTCTCGGCTT TTTGCCGGAA CTGTATTTGG ATCTGGCGCT CATGCGTATC
ATCGAGCCCG CCTCGAAGCT GCGTACGCTG GAGCTTCTGG AGTGGTATTT CAATGTCCGT
TATGCCAGAA GAACTCTCCA TCGCATGCTC AGCAAGCTGC TTGAGCATCA GGAGGCGATC
GAGATGGCAG CTATCCAGAC TGCCCAGAAC GATCTGCAGG ACAAATTCAT CCTGGTGCTC
TACGATGTCA CTACGCTGTA CTTCGAGTCC TTCAAGGAAT ATGACTTCCA ACGTCCCGGG
TTCTCGAAGG ACAACAAGCC ACAGCAGCCG CAAATCGTCA TCGGCCTGAT CACCACCCGC
TCAGGGTTCC CTGTCATGCA CGAGGTGTTC GAAGGCAACA CCTTTGAGGG GCATACCATG
CTGGCCATCG TGCACCGCTT CCAGGAGCGG GTCGGAAATA CCAAGCCGGT GATCGTTGCC
GATGCCGCTA TGCTCTCGAA AGCCAACATG CAGCAACTGG AGTCTGAAGG ATATCGCTAT
ATCGTGGGAG CCCGGCTGGC GAACACTGCA GCCAACTTCA TCGATCAGAT TCACACGGCA
CTGCCTCGAA CGGACAAGGC TATACGACGC TTCAGCTATG ATGGCGTCGT GAAAAATGCC
ACCATGATCT GTGAGTTCTC TGATGCCCGG TACAAGAAGG ACAAGCGGGA CTTCGACAAG
CAGGTCAAGC GAGCGCTTGT TCTGCTTGAA CGCAATGAGC CCGGCAGACG GGCTAAATTC
ATCAAGAAGT CCAAAGAGAA AGACAAACCC TTCATCTTCG ATGCCGGCCT ACAGGCAAAG
ACAGAGAAGC TTCTCGGTAT CAAAGGCTAT GTCACTAACA TCACAGAAGA CGAGTTGTCC
AGTAGCGAGC TTATCGCATA CTATCGCGAT CTCTGGCACG TCGAACAGGC ATTCCGCATG
AGCAAGTCTG ACCTGCAGGC ACGACCGATC TTCCATCGTA CGCAAGATGC CATCCGCGCC
CACATGCTCA TTTGCTTCAT GGCACTGATG ATGGGAAAAT ACCTCGAGAT AAAAACGGGT
CGCTCGTTAC GACAGATCCG GAAAAAACTC TGGCAGGTGC ATGAAGCCCA TATCCTTGAT
GAGCAAACCG GTGAGGTACA TGTGATGCAG ATGGATGTCG GCGAATTTGC GGGCAGCGAA
CTCAATAAAC TCTTGGATTC TGGGTTTTCG CACTAA
 
Protein sequence
MRGDLTIRKV RTASGATAVQ VVQNKGKQRS FLKHIGSAHN EQELELLMAK AEKYVETHCR 
QPSLFADTPP SPSSLQTALA GSKLVGVTHQ YARNTLHACA RKCGLGFLPE LYLDLALMRI
IEPASKLRTL ELLEWYFNVR YARRTLHRML SKLLEHQEAI EMAAIQTAQN DLQDKFILVL
YDVTTLYFES FKEYDFQRPG FSKDNKPQQP QIVIGLITTR SGFPVMHEVF EGNTFEGHTM
LAIVHRFQER VGNTKPVIVA DAAMLSKANM QQLESEGYRY IVGARLANTA ANFIDQIHTA
LPRTDKAIRR FSYDGVVKNA TMICEFSDAR YKKDKRDFDK QVKRALVLLE RNEPGRRAKF
IKKSKEKDKP FIFDAGLQAK TEKLLGIKGY VTNITEDELS SSELIAYYRD LWHVEQAFRM
SKSDLQARPI FHRTQDAIRA HMLICFMALM MGKYLEIKTG RSLRQIRKKL WQVHEAHILD
EQTGEVHVMQ MDVGEFAGSE LNKLLDSGFS H