Gene Haur_3396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3396 
Symbol 
ID5735257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4278197 
End bp4279354 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content53% 
IMG OID641280543 
ProductIS605 family transposase OrfB 
Protein accessionYP_001546160 
Protein GI159899913 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000848869 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATGC TGACCCGTTG TTACAAATAC CGTCTGCAAC TCACACCGAC CCACGTTGAA 
ACCTTGGTAC AGTGGGCGGG TTGTCGGCGC TTCGTCTGGA ATTGGGCGCT GCACTGCAAG
CAAACCCACT ACCAAACAAC GGGTCAACGG CTGAGCTATC AACGGCTTGC GGCGGCATTG
GTTGATCTGA AACGTCAGCC CAAAACGGCA TTTTTGCGTG ATTGCCACTC ACAACCGTTG
CAACAAACCT TGATGGATTT GGAAACGGCC TTCAGCAACT TTTTTGCCAA ACGCGCCAAG
TACCCGCGAT TCAAATCACG CAAAATCACG CCGCACAGCC TACGCTTCCC GCAAGGTGTG
ATCGTGGTTG ATGAACATAC CATCAGCGTG CCAAAAATCG GGCTGATACG GGCGATCATT
CATCGCCCCT TGCAAGGCAT AGCGAAGAGT GCAACGATCA AACAGGATGC CACAGGCGCG
TGGTGGGTCA TTTTCGTCTG TCATATCGAC CTCCCTGATG TTCAACCAAC AGCTGATCGA
CCTGTGGGCA TTGATGTCGG GCTTGAATCC TTCACCACGC TGTCAACGGG CGAGAAGACA
GCACCACCAA AGTTCTACTG TCGAAGCCAA AAGAAACTTG CCCGTGCTCA GCGCAAACTC
TCACGCGCCC AAAAGGGCAG CAACAACCGC TTGAAAGCAA AAAAGCACGT TGCCCGTATC
CACAAGAAAA TCAACAACCA ACGTGCCGAT TGGCTGCATA AGCATGCGTT GGGGATAGTT
CGCCAATTTG ACGTGGTGTG CATCGAAGAC CTGAATATTA AAGGCCTTGC GAGAACCAAG
CTGGCCAAAT CATTCAGTGA TGCCGCACTG AGTACCTTCA TGCAACGATT GCAGGAAAAA
GCTGAATGGC ACGGACGACG AGTTGTTAAG ATTGGGCGGT TCTACGCCTC ATCGAAAACT
TGCCACTTCT GTCATTCCAA GACTGCCTTG ACGCTGGCTG ACCGCGTGTG GACATGCCCC
ACCTGTGGCA CGACCCATGA TCGCGATGGC AACGCCGCGA TCAACATGCT GTATGAAGGG
CTACGCCTGC TTGCCGTTGG GACGACGGAA AGCCAAAACG CTGCTCGAGA TGGTGTAAAC
CCAGCGAAAC GCTGGTAG
 
Protein sequence
MRMLTRCYKY RLQLTPTHVE TLVQWAGCRR FVWNWALHCK QTHYQTTGQR LSYQRLAAAL 
VDLKRQPKTA FLRDCHSQPL QQTLMDLETA FSNFFAKRAK YPRFKSRKIT PHSLRFPQGV
IVVDEHTISV PKIGLIRAII HRPLQGIAKS ATIKQDATGA WWVIFVCHID LPDVQPTADR
PVGIDVGLES FTTLSTGEKT APPKFYCRSQ KKLARAQRKL SRAQKGSNNR LKAKKHVARI
HKKINNQRAD WLHKHALGIV RQFDVVCIED LNIKGLARTK LAKSFSDAAL STFMQRLQEK
AEWHGRRVVK IGRFYASSKT CHFCHSKTAL TLADRVWTCP TCGTTHDRDG NAAINMLYEG
LRLLAVGTTE SQNAARDGVN PAKRW