Gene Haur_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2039 
Symbol 
ID5733928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2542293 
End bp2543429 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content56% 
IMG OID641279183 
Producttransposase IS4 family protein 
Protein accessionYP_001544810 
Protein GI159898563 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAAA GAAAGGTGAT CATCATGGGG AGCAGTCATG AATTATACAC GCGGGTTTGG 
ACGACGTTGC ACCAGTTTCA TCCAACCCTC CATGCACGGC GACTGGCGAC CTGGGCTTGG
GTCATTGTCG GCTTACTCCA TGCGCGATCC GTCCATCTTA GCGCTGTGGC GCTCCATCTG
GCGAGCGATG CTGAGGCCGC TGGGCGAATC GCACGCATTC GGCGCTGGCT CGCCAATCCG
TGGCTTGATA CCCAGTTTCT CTATCGTCCG CTCATTACCC ATGTGCTCAC GGCTTGGCGC
AATCGCGACA TCACTATCAT GATTGACGGG TGCTACGTCA ATCACGACAA ACTCCAGATG
GTTCGCCTGT CCTTATCCCA CTGTTATCGG GCAATCCCTC TCGCGTGGCA GGTCATGAGC
CATCACGGGA ACGTCTCCGT GGAGTCATGT CAGCGGATGC TTAATCGGGT ACAACAACTT
CTGATCGGAA CCCGTCGTGT GACGTTTCTT GCGGATCGGG GCTTTCGCGA TTGGGCATGG
GCTGCAAGCT GCCAGCGCCG CGGCTGGGAT TACATCATTC GGATCGCAAA TACAACGACC
ATTCGCTGGG ATGATGGCCC ATGGATGGCG ATCAACACTA TGGCAGTAAA GCCCGGCAAG
TCCGTCTATC TGCGCAATGT TTTGCTCACC CAAGACGGAG AATGGCGCTG TACTATCGCC
ATTACGTGGA CACGTGCCAC GAAAACCAAG CCTGCGGAAC GATGTGCGGT AATAACCAAC
CGAGAGCCGA GCAAATGGAT TCTGAACCAT TATTTGCGCC GTATGCATAT CGAAGAGAGC
TTCCGCGATG ACAAATCGGG CGGATTTGAT TTGGATGCCA GTCGCCTGCG CGATCCGCAG
CGGCTTGATC GGCTGCTATT GGCGATCGCC GTGGCAACGC TCTGGATGTA TGAACTGGGG
GAACGCGTAC TCAAGGATGA GCAACGTGCC CACGTCGATC CAGGCTATCA GCGTCAACTC
AGTGTGTTTC AGCTAGGATG GCGTTGGCTC CGGCGAGCAT TGAGCCTTGC CGATATCCCG
AAATGGAACC TCACGCTCCA TCCGTTTCAG CCTGAGCGGG TCGCAGCAAA GTGTTAG
 
Protein sequence
MLQRKVIIMG SSHELYTRVW TTLHQFHPTL HARRLATWAW VIVGLLHARS VHLSAVALHL 
ASDAEAAGRI ARIRRWLANP WLDTQFLYRP LITHVLTAWR NRDITIMIDG CYVNHDKLQM
VRLSLSHCYR AIPLAWQVMS HHGNVSVESC QRMLNRVQQL LIGTRRVTFL ADRGFRDWAW
AASCQRRGWD YIIRIANTTT IRWDDGPWMA INTMAVKPGK SVYLRNVLLT QDGEWRCTIA
ITWTRATKTK PAERCAVITN REPSKWILNH YLRRMHIEES FRDDKSGGFD LDASRLRDPQ
RLDRLLLAIA VATLWMYELG ERVLKDEQRA HVDPGYQRQL SVFQLGWRWL RRALSLADIP
KWNLTLHPFQ PERVAAKC