Gene Haur_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1938 
Symbol 
ID5733827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2350188 
End bp2351510 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID641279082 
Producttransposase IS4 family protein 
Protein accessionYP_001544709 
Protein GI159898462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000606768 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATCA TACCGCAGAT CAGTCACGCT ATGCACACCC TGTTAACGAC CACGACCGAG 
GCCATTGCCG CTGCTCAGCA GTATGTCAAA CGCCCTGACC GCGCCAAATT CTCTCCCAGT
ACCCTCGTTC AAACCCTCGT CTATGGCTGG CTGGCTCAGC CAACCGCCAC GGTGGAGCAA
TTGGCCCAAA TGGCCTGCCG CATCGGCGTT GCTGTCTCTC CCCAAGCGAT TGATCAACGC
TTTACCATGG CCACCGCTGA CCTGCTCCAC CAGCTCGTCA TCGCCAGCAT CCACCCCGTC
ATCGCCGCCA ATCCCGTGAC CCTGCCCATC CTCCAACGCT TTGCCAGCGT GCGCGTTCAT
GACAGCACCT CCATTGGCTT GCCCGATGCC CTGACCGGCA TCTGGCGCGG CTGTGGCAAT
GCGACAACGG GCGGCGGAGC CACCCTGAAA TGTGGTGTCC AGCTTGATGT GCTCACGGGC
GCGATCACCG CCCTCGATCT GGTCAACGGA CGCGCAGCGG ATCGAGCGCT CCCGCTTCAG
CAGCGCGATC TGCCGCCGGG GAGTTTACTG CTCGCCGATC GCGGGTTTTA CCACTTGGAG
CGGTTGCGCC AGCACGATCA GCAGGGGGTT TTGTGGATCA CCCGCCTGCC CAGCAACGCC
GTCGTGGCCT ATCCGGGACA CGCCGCGCAG CCGTTGGCCA CGTTTGTCCG CGAGCTTGGC
CCGGTGGCAA CGTGGGATTG TGCGATCATC GTGGGGAAGG AGCAGCAGGT GCATGGGCGG
CTGATCGTCA CGCGGGTGAC GCAGGCGGTT GCCGATCAGC GCCGGGCACG GATTCGCCAG
CATGCCCAGC ACCAGCATCG GATGCCGTCG GCAGCGGCCT TGGCGCTGGC GGATTGGAAT
GTGGTCTTCA CGAATGTGCC ACGGCTGCTG ATCAGCACGA CCGAGGTCTG GACCGTGATG
CGGGTGCGCT GGCAAATTGA ACTGCTGTTT AAGCTCTGGA AAAGCCATGC ACGAATCGAT
GACTGGCGCA CGGCGAATCC GGCACGGGTC TTGTGTGAAA TCTATGCCAA ATTGATTGGG
CTGGTGTTTC AGCAGTGGCT GCTGGCCGCC AGTAGCTGGC ATGATCCGGA GCGGAGTTTG
TTCAAAGCCG CGCCGATTGT CGCGGGGATG GCGGGCGAAC TGGCCAGCAC GCAGGCTGAT
CCGCCGCAGT TTTTGCGGGT GTTGACACGG CTCGCGGCCT TGATTCAGCG CTGGGCGAGG
ACGAACAAAC GCCAGCAGCC ACCGACCACG GCTCAGCGTT TACGGGCATT AACGGCAGCA
TGA
 
Protein sequence
MEIIPQISHA MHTLLTTTTE AIAAAQQYVK RPDRAKFSPS TLVQTLVYGW LAQPTATVEQ 
LAQMACRIGV AVSPQAIDQR FTMATADLLH QLVIASIHPV IAANPVTLPI LQRFASVRVH
DSTSIGLPDA LTGIWRGCGN ATTGGGATLK CGVQLDVLTG AITALDLVNG RAADRALPLQ
QRDLPPGSLL LADRGFYHLE RLRQHDQQGV LWITRLPSNA VVAYPGHAAQ PLATFVRELG
PVATWDCAII VGKEQQVHGR LIVTRVTQAV ADQRRARIRQ HAQHQHRMPS AAALALADWN
VVFTNVPRLL ISTTEVWTVM RVRWQIELLF KLWKSHARID DWRTANPARV LCEIYAKLIG
LVFQQWLLAA SSWHDPERSL FKAAPIVAGM AGELASTQAD PPQFLRVLTR LAALIQRWAR
TNKRQQPPTT AQRLRALTAA