Gene Haur_2436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2436 
Symbol 
ID5734317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3120861 
End bp3122681 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content60% 
IMG OID641279577 
Producthypothetical protein 
Protein accessionYP_001545204 
Protein GI159898957 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTATT CTAGCAGCGC TGTCAAGCTT CATGCCACGA ATGCCCTGCA TGCAGTGGCC 
GTTCGGACGT TGCGCACTCT CGCTCATTAT ACCGATTGTT CGCGCCAACG TAGCCAAACT
GGCCATGCTT TAGCCGCTGC GCTCCAGCGC CACTGGCAAA CACCGCACTA TCGCCAGCTG
GTGCGCCGTT CCTTAACTGC CGCCGACCGA GCATTGTTGC AGGCATGGTG GCAGGGTCAG
CAGCCGTTGC CAACGCCACA AGCGCTTGAT CTCTGGCGCT GGCAGGCTCC TTGGCCTACT
CTGGAGCAGC TCTCGTCGGA GCAACGCTTG GCTGCCTTAG GCTTGGTGGT GCCAATCCGC
ACGACCACAG GCCGCACGGT GGTCTTAATT AATGATACCA GCCGTTGGTT ACGCCGCACT
CCGCCACTGC CACCAACTCC TGTTGCTGCC AGCTTGCAAG CCTTGTTTCA AGCGGTGGTC
GCGTTGCTTG CCGCCTGTGC CAATACCCCT CAACCCCGCC AAGCAGCTGG CTTGGCGCTG
CATATCGCGC AATCAGCCGG CTGGCTGGCC GATCGGCTTA ATCAATGGCG CATTACGCCG
CGTGGTCGGG TTTGGCTGCA TAGCCCAATC GCTGAGCAAC AACGCTTGTT ACACCAACAG
CTCATCACCT GTAACCCGCC TGCACGTGGC TTGGTCGCAT GGCGTAGCCC CGATTGGGCG
GCATTATTTG CCGATTTGGA ACGGTTGATG GAGGCCCAAG CCCAGCGGCG CAGCATGGAT
GTGGCTGCCT TGCTCCACGA TCATCCAGCG TGGAATGGAT TGCCAGCAGC CCAGCAGATT
CGGCTCGTGC ATGGTTGGTT GTGCACCGTC TTGCAACCAG CGGGCGTGGT GAGCTTAGCC
AAGGGCTGGC TCTTTTGGCA TGGCTGGCAG CAGCTCGCAG CCCAAGCGCC AGCCTTCGAT
GGCCTGCGCT TGCCCAAACG TGCGGCGCTC CCCGCAGCCT TACAGGTGTG GGGATTAACT
TGGGGGATGG CAACGAGCCA TGGGTGGCGC ATAACGCACG CATCGGTTAC CGCTCGCTTG
CAACAGGGGC TTGATCTCAA TGGTTTTTGG CAGCCGATTG ATCAGTGGTA TGCTGAACGG
CCCGCCCTTA TTCAGGCCTT GATCGCAAAA CTTCAGGCCA CGCCGCCATT GCGCCTGCGT
CGCATCACAC TGCTTGAGGG TAGCCCCGAA GCCGTGGCAA GCGCCCACGC CAATTGGCAG
ATTCAAGCCT ACCTACAACC TGGGTTTGAT CAAGCCCAAC GGGTGGTGTG CCAAGGGGCG
GAGCAGGTGG TAGCCAAGGT GTTGGGACTA CATGCCACGC CTACGCCAAG CCTCGATACG
CAGACGAGCA TACAGATAAT GGCCTTGCGG ATTGCAGCTC AGCACCTGCC CAGCCATCGG
CTTGCCTTTA ATCAGCAAGC CCAGCATCTG CTGGCCGAGC TGTCGTTTGA GCAACGGTGC
ATCATCGACG ACGATTGGGA ACGTCTCCAA TTAAGTGATG CGCCAGACCT ACTAGCGAGC
AGTCAAGCGC TTGCCGTTGG GCAACAACCA CGAGCGCAGA TCACGGTTGA ACAGGCTCGC
CAAACATGTC GCCAAGCGAT CAACAACCAG CAAAGCGTGA CCGTGCGCTA TTACACGCCA
GCCGAGCATC GCATCACGAC GCGCACGATT CGCCCGCTCG AGCTGACCAG CACCGGGATG
CGCGGTTGGT GTGAATTACG GCAACAGGAG CGGGCTTTTC GCTTTGACCG AATCTTGGCG
ATTGAAGCCA ATACCAGTTA A
 
Protein sequence
MAYSSSAVKL HATNALHAVA VRTLRTLAHY TDCSRQRSQT GHALAAALQR HWQTPHYRQL 
VRRSLTAADR ALLQAWWQGQ QPLPTPQALD LWRWQAPWPT LEQLSSEQRL AALGLVVPIR
TTTGRTVVLI NDTSRWLRRT PPLPPTPVAA SLQALFQAVV ALLAACANTP QPRQAAGLAL
HIAQSAGWLA DRLNQWRITP RGRVWLHSPI AEQQRLLHQQ LITCNPPARG LVAWRSPDWA
ALFADLERLM EAQAQRRSMD VAALLHDHPA WNGLPAAQQI RLVHGWLCTV LQPAGVVSLA
KGWLFWHGWQ QLAAQAPAFD GLRLPKRAAL PAALQVWGLT WGMATSHGWR ITHASVTARL
QQGLDLNGFW QPIDQWYAER PALIQALIAK LQATPPLRLR RITLLEGSPE AVASAHANWQ
IQAYLQPGFD QAQRVVCQGA EQVVAKVLGL HATPTPSLDT QTSIQIMALR IAAQHLPSHR
LAFNQQAQHL LAELSFEQRC IIDDDWERLQ LSDAPDLLAS SQALAVGQQP RAQITVEQAR
QTCRQAINNQ QSVTVRYYTP AEHRITTRTI RPLELTSTGM RGWCELRQQE RAFRFDRILA
IEANTS