Gene Haur_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2046 
Symbol 
ID5733935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2557730 
End bp2558875 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content59% 
IMG OID641279190 
Productputative transposase 
Protein accessionYP_001544817 
Protein GI159898570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.367263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACGC TCAACGCCAT TGTCCAGCGC TATGGTCCGG CCTATTGGGC CACCCCGCCA 
GCGCCGATCA GTCCTGACCA ACGCCGGGTG CTCCATGCCC TGGATGCATG TCGGACGGAG
CACCTAGGCG GTCAGGTGTT CATCTGCCCC CACTGTCACA TAGCCCGCTA CAGCTACCAT
TCATGTCGGA ACCGACATTG TCCCACCTGC CAACACGATG CTGGCCAAAC CTGGTTGGCC
AAGCAACAGG CGCTGCTGCT GCCCGTCCCC TACTTTCTCG TCACGTTTAC CTTGCCGGCG
GAACTCCGGG CGTTTGCGGC GGCCAATCAG CGCCAGGTCT ATGATTGTTT CTTTCGGGCC
TCAGCTGCGG CACTCCAGCA GCTAGCCCAC GATCCTCGGC TGCTCGGCGG GCAATTGGGG
ATGCTGGGCA TCCTGCAAAC CTGGACGCGC GACCTGCGCT ACCATCCGCA TATCCACTAT
CTCATTCCGG CTGTTGTCCG TACCCCCGAT GGCACCATCT GCCAGCCTGC CCCAGGGTTC
CTGCTCCCCG TGCGGCCCTT AGCGCTGCTA TTTCGCGGCA AACTGCGTGC TGCCATCGGC
CAACTTCCGG GCGGCACGAC CCGTGATTCG GCGATCTGGC AGCGCCCATG GGTCGTCGAT
TGTCGCCCAG TTGGCACCGG TGAAACAGCC TTGAAATACT TGGCACCGTA CATCTTCCGG
GTGGCCTTGA GCAACAATCG GCTCCTCAGC ATGGATCATG ATAACGTGAC CTTTCGCTAC
ACCAATGGGC AGACGCACCA GACCTGTACC AAAACCCTCT CGGCACTGAC ATTTCTCGAG
CAATTTCTTC AACACGTCTT GCCAAAAGGC TTTGTCAAAG TGCGTTATTT TGGCTTGTTT
TGTCCCGCCA AGCGGGCGTT CCTCCGCCGC ATCCGCGCCC AATTGATGCT CTCCCGTGGA
CAGGAGTTCA GTCAGCCACC CGTCATTCAT TGCCTGCAGG AAGCACCGCT GTGCCCGCAG
TGCGGTGCTG TGATGCGACG CCAAGAACTT CCGGTGGTTT GTCAAGAAGT GTGCAAAATC
AAATTAGCCA GGCATGCGAA TTTTCCGAAG CTGAAGTGTT CGAACAACGG CATAAAACAG
CAATAA
 
Protein sequence
MITLNAIVQR YGPAYWATPP APISPDQRRV LHALDACRTE HLGGQVFICP HCHIARYSYH 
SCRNRHCPTC QHDAGQTWLA KQQALLLPVP YFLVTFTLPA ELRAFAAANQ RQVYDCFFRA
SAAALQQLAH DPRLLGGQLG MLGILQTWTR DLRYHPHIHY LIPAVVRTPD GTICQPAPGF
LLPVRPLALL FRGKLRAAIG QLPGGTTRDS AIWQRPWVVD CRPVGTGETA LKYLAPYIFR
VALSNNRLLS MDHDNVTFRY TNGQTHQTCT KTLSALTFLE QFLQHVLPKG FVKVRYFGLF
CPAKRAFLRR IRAQLMLSRG QEFSQPPVIH CLQEAPLCPQ CGAVMRRQEL PVVCQEVCKI
KLARHANFPK LKCSNNGIKQ Q