Gene Haur_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4737 
Symbol 
ID5736581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6047072 
End bp6048187 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content52% 
IMG OID641281902 
ProductIS605 family transposase OrfB 
Protein accessionYP_001547496 
Protein GI159901249 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACGC TGACCCGTTG CTATAAATAT CGCCTGTATC CTGCCACTGA CCAACAAAAC 
ACCTTGGTAC AGTGGGCGGG TTGCCGACGA TTTGTCTGGA ATTGGGCATT GCACTGCAAG
CAAACCCACT ACCAAGCAAC GGGTCAACGG CTGAGCTATC AACGGCTTGC GGCGATGTTG
GTTGATCTGA AACGTCAGCC CAAAACGGCA TTTTTGCGTG ATTGCCATTC GCAACCCTTG
CAACAAGCGC TGATGGATTT AGAAACGGCC TTTACTCACT TTTTTGCCAA ACGGGCGAAG
TATCCCCGTT TCAAAGCACG CAAAGTCACA CCGCATAGCC TCCGCTTCCC GCAAGGCGTG
GTCGTCGTTG ATGAACACAC CATCAGCGTG CCAAAAATCG GGCTGATGCA GGCGATCATT
CATCGCCCAC TGCTGGGAAC AGCAAAGGGC GCAACGATCA AACAAGACGC AACGGGTGCA
TGGTGGGTCG TTTTTGTTTG CCACATCAAC CGCCCTGATG TTTTGCTAAC TACTGATAAT
CCTGTGGGCA TTGATGTGGG ACTTGAATCC TTCACTACCC TGTCAACGGG AGAGAAAACT
ACACCGCCCA AATTCTACCG TCGAAGCCAA AAGAAACTTG CCCGTGCTCA GCGGAAACTC
TCACGCGCCC AAAAGGGCAG CAACAACCGC TTGAAAGCTC GTAAGCACGT TGCCAAAATT
CACCAGAAAA TTAGCAACCA ACGCGCCGAT TGGCTGCATA AGCATGCGTT GGGGATCGTT
CGCCAATTTG ATGTGGTGTG CATTGAAGAC CTGAATCTCA AAGGCCTTGC GAAAACCAAG
CTGGCCAAAT CATTCAGTGA TGCCGCCCTG AGTACCTTCA TGCAGATGTT ACACGATAAA
GCGGAATGGC ACGGACGGCG AGTGATTAAG GTTGGGCGGT TCTACGCCTC ATCAAAAACC
TGCCATCACT GCCAAACGAA AACCGCCTTG ATGCTATCAG ATCGCGTGTG GACATGCCCC
ACCTGTGGCA CGATCCATGA TCGCGATAGG AATGCGGCGA TCAACATCGT GCACGAAGGA
ATACGCCTGC TTGCCGTTGG GACGACGGAA AGCTAA
 
Protein sequence
MRTLTRCYKY RLYPATDQQN TLVQWAGCRR FVWNWALHCK QTHYQATGQR LSYQRLAAML 
VDLKRQPKTA FLRDCHSQPL QQALMDLETA FTHFFAKRAK YPRFKARKVT PHSLRFPQGV
VVVDEHTISV PKIGLMQAII HRPLLGTAKG ATIKQDATGA WWVVFVCHIN RPDVLLTTDN
PVGIDVGLES FTTLSTGEKT TPPKFYRRSQ KKLARAQRKL SRAQKGSNNR LKARKHVAKI
HQKISNQRAD WLHKHALGIV RQFDVVCIED LNLKGLAKTK LAKSFSDAAL STFMQMLHDK
AEWHGRRVIK VGRFYASSKT CHHCQTKTAL MLSDRVWTCP TCGTIHDRDR NAAINIVHEG
IRLLAVGTTE S