Gene Haur_5222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5222 
Symbol 
ID5737180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp322626 
End bp323864 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content55% 
IMG OID641282386 
Producttransposase Tn3 family protein 
Protein accessionYP_001547977 
Protein GI159901731 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCAGTC TGCTCGATAT GCTGAAAGAA ACCGATCTTC GCGTGGGAGT CACGACCGAC 
TTTCACACGA GTACGGCACG GGAACACCTT GATCGTGCCA CCCTCCAACG CCGCCTCTTA
CTCTGCTGCT ATGGCTTGGG AACCAATATT GGCCTCAAAC GGGTCTGCGC AGGGTCGCCC
GGTGACCAGC ATAAAGACCT CGCCTATGTG CGGCGGCGCT TTTTGCTGCG GGATCAGCTG
CGGAATGGCA TTGCCAAAGT CGTCAATGCG CTCTTTGACG CGCGGTTACC GCAGATTTGG
GGTGAGGGAA CCACCGCGTG TGCCTCGGAC TCCAAACTGT TTGGGGCATG GGATCAAAAT
CTGATGACCG ATTGGCATCC CCGGCATCGC GCCGCTGGGG TCAAAATTTA CTGGCACGTG
GATAAAAAAG CCGCCTGCAT TTACTCGCAG CTCAAACACC CCTTTGCCTC CGAGGTTGCC
GCGATGATGG AAGGTCTGCT CCATCACAAT ACCACCATGA CGGTCGAACG CAACTATGTC
GATACCCACG GCCAAAGCGA GATCGCCTTT GGCTTTTGTC ATGTTTTAGG CTTCACATTA
ATGCCACGAT TCAAGGCCAT CCATCGGCAA AAACTCTATC GTCCTGAGCG CGGCAATCGG
ACGGCCTACC CCAATCTCCA GCCGGTCTTG CAACGACCGA TTAATTGGGA GCGTATTCGG
CGTGAATACG ACCAAATCAT CAAATATGCG ACGGCCCTCC GCCTGCGAAC TGCCGAAACC
GATGCGATTC TGCGCCGATT TAGCCGACGC AATTTTCAGC ACCCAACCTT CAAGGCACTC
CTTGAATTGG GGCGGGCCAT CAAGACCATC TTTCTGTGCC AATACCTGCA TTCGGAGGAT
ATGCGTCGGG AGATACACGA GGGCTTACAG GTGGTGGAAA ACTGGAATGG CACGAACGAC
TTCATCTTCT ACGGCAAGGG GCGTGCGTTC AATACCAACC AGCGAGCCGA TATGGAGGTG
TCCATGTTGT GCCTGCATTT GCTCCAAGTC TCCATGGTGT ACATCAATAC CTTGCTCATT
CAGGAGGTAT TGCGGGAGCC AGCGTGGGCG AATCGGTTAA CGCCCGATGA TCTGCGGGCA
CTTACGCCGC TGATCTACAG TCATGTCAAT CCCTTTGGTG TGTTTCTGCT CGACCTCTCG
CAGCGATTGC CGCTCAAACC GATGCGATTG GCGGCCTGA
 
Protein sequence
MISLLDMLKE TDLRVGVTTD FHTSTAREHL DRATLQRRLL LCCYGLGTNI GLKRVCAGSP 
GDQHKDLAYV RRRFLLRDQL RNGIAKVVNA LFDARLPQIW GEGTTACASD SKLFGAWDQN
LMTDWHPRHR AAGVKIYWHV DKKAACIYSQ LKHPFASEVA AMMEGLLHHN TTMTVERNYV
DTHGQSEIAF GFCHVLGFTL MPRFKAIHRQ KLYRPERGNR TAYPNLQPVL QRPINWERIR
REYDQIIKYA TALRLRTAET DAILRRFSRR NFQHPTFKAL LELGRAIKTI FLCQYLHSED
MRREIHEGLQ VVENWNGTND FIFYGKGRAF NTNQRADMEV SMLCLHLLQV SMVYINTLLI
QEVLREPAWA NRLTPDDLRA LTPLIYSHVN PFGVFLLDLS QRLPLKPMRL AA