Gene Haur_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3532 
Symbol 
ID5735391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4446258 
End bp4447502 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID641280679 
Producttransposase IS4 family protein 
Protein accessionYP_001546296 
Protein GI159900049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000268669 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCAC ACAACGACCT CGGGTTGGTA TACTGGGTGG TGACGAAACC AAACCCATCC 
CAAAACCGAG GTCGTATGCC GAGTATACCA GCCCTTGCCC ACTGGATCAA CACCATCCTG
CGCACCGCCG TGCCCACCCT CTCGCCCTGG ACTGCCCGTC GGCTCACCGA TTGGCTCGTC
AGCATCCTGC TCATGCCGTC CATCACCACG CGCGTCGTGG CCTGGGGCTG TGCCCTTGGA
CTGTCCACCG CTGCCCACGC CGCCAGTCAC GAACGCCGAC TGCGCCGCAC CTATCGGGAT
TCCCAGCTGT CGTGGTCGCT CCATCGCGCC ATCCTCGCCA CCACCCTCCA CATCGCACCC
ACTGAATCCG TCACCGTCAT CATCGATGAA ACCACCCACA CCGACCGCTG GACCCTGCTC
ACCGCCGCCC TCTGGTATCA CGGTCGCGCC ATTCCGCTTG CCTGGGTGCT CCATCCCGGC
TATACCCGCC GCGCCACCGC CTTCTGGACC GATGTTGCCA CCCTGCTGGA GCGCGTGCAG
CAGGTGCTGC CCAATGCCAT GTCCGTCGTC GTCGTGGCCG ACCGTGCTTT TGGCTGCCCC
GCCTTCACCG ATCAGGTCGC GGCCTACGGC TGGGGCTGGG TCGTGCGCGT CCAAGGCCAT
ACCCGCATCC AACTGCGGGG GCACACCGAA ACCATGATCC GCACGCTGGT CACGCGAGGC
CATCGCGTGG TGCGGCGGGG CCATGCCTTC AAGAAGGCGG GCTGGCGAAC GGTGACAGTG
GTGGCCGCAT GGGAGGCGAC GTGTCACGAG CCGTTACTGC TGGTGAGCAA TCTGGAGGGC
ATTGGGGCGA TTCGGCAGGC GTATGGGCGG CGCTCTGCGA TTGAGGCCCT GTTTCGCGAT
TGGAAAACGG CGGGCTGGCA ATGGGAGGCG AGCCAGTCGC GGAGCCAGAC GACGCAGGAG
GCCTTGGTGC TGGGCATGGC GATCGCGACG GTGCTGGTGC TGCTGGTCGG GACGGCGGAG
GCGCAGGCGG TGCTGGCCGA ACGCGGGGAT CGCCCCAGCC CGCGCCGCCC ATGGGCGGCA
CGAGAAAGTC TGTTTCGGTT GGGGCGGTAT GGGGTGCTGC GCTGGCTGTG GACGGGAACG
CAGCCAGCGC TGGGAGCGCG ACTATCGTTG GCGGGAACGG CGCTGCACGA ACGGTGGGCC
ACGACGGTGA CGCGGGGTGG TCGGCTCGGG ACGGCCATCC CCTAA
 
Protein sequence
MGSHNDLGLV YWVVTKPNPS QNRGRMPSIP ALAHWINTIL RTAVPTLSPW TARRLTDWLV 
SILLMPSITT RVVAWGCALG LSTAAHAASH ERRLRRTYRD SQLSWSLHRA ILATTLHIAP
TESVTVIIDE TTHTDRWTLL TAALWYHGRA IPLAWVLHPG YTRRATAFWT DVATLLERVQ
QVLPNAMSVV VVADRAFGCP AFTDQVAAYG WGWVVRVQGH TRIQLRGHTE TMIRTLVTRG
HRVVRRGHAF KKAGWRTVTV VAAWEATCHE PLLLVSNLEG IGAIRQAYGR RSAIEALFRD
WKTAGWQWEA SQSRSQTTQE ALVLGMAIAT VLVLLVGTAE AQAVLAERGD RPSPRRPWAA
RESLFRLGRY GVLRWLWTGT QPALGARLSL AGTALHERWA TTVTRGGRLG TAIP