Gene Haur_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2388 
Symbol 
ID5734269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3040804 
End bp3042624 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content60% 
IMG OID641279529 
Producthypothetical protein 
Protein accessionYP_001545156 
Protein GI159898909 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTATT CTAGCAGCGC TGTCAAGCTT CATGCCACGA ATGCCCTGCA TGCAGTGGCC 
GTTCGGACGT TGCGCACTCT CGCTCATTAT ACCGATTGTT CGCGCCAACG TAGCCAAACT
GGCCATGCTT TAGCCGCTGC GCTCCAGCGC CACTGGCAAA CACCGCACTA TCGCCAGCTG
GTGCGCCGTT CCTTAACTGC CGCCGACCGA GCATTGTTGC AGGCATGGTG GCAGGGTCAG
CAGCCGTTGC CAACGCCACA AGCGCTTGAT CTCTGGCGCT GGCAGGCTCC TTGGCCTACT
CTGGAGCAGC TCTCGTCGGA GCAACGCTTG GCTGCCTTAG GCTTGGTGGT GCCAATCCGC
ACGACCACAG GCCGCACGGT GGTCTTAATT AATGATACCA GCCGTTGGTT ACGCCGCACT
CCGCCACTGC CACCAACTCC TGTTGCTGCC AGCTTGCAAG CCTTGTTTCA AGCGGTGGTC
GCGTTGCTTG CCGCCTGTGC CAATACCCCT CAACCCCGCC AAGCAGCTGG CTTGGCGCTG
CATATCGCGC AATCAGCCGG CTGGCTGGCC GATCGGCTTA ATCAATGGCG CATTACGCCG
CGTGGTCGGG TTTGGCTGCA TAGCCCAATC GCTGAGCAAC AACGCTTGTT ACACCAACAG
CTCATCACCT GTAACCCGCC TGCACGTGGC TTGGTCGCAT GGCGTAGCCC CGATTGGGCG
GCATTATTTG CCGATTTGGA ACGGTTGATG GAGGCCCAAG CCCAGCGGCG CAGCATGGAT
GTGGCTGCCT TGCTCCACGA TCATCCAGCG TGGAATGGAT TGCCAGCAGC CCAGCAGATT
CGGCTCGTGC ATGGTTGGTT GCGCACCGTC TTGCAACCAG CGGGCGTGGT GAGCTTAGCC
AAGGGCTGGC TCTTTTGGCA TGGCTGGCAG CAGCTCGCAG CTCAAGCGCC AGCCTTCGAT
GGCCTGCGCT TGCCTAAACG TGCGGCGCTC CCCGCAGCCT TACAGGTGTG GGGATTAACT
TGGGGGATGG CAACGAGCCA TGGGTGGCGC ATTACCCAAG CATCCGTCGC CGCTGCGCTG
GCTAACGGAC TTGATCTCAG TAGTTTTTGG CAGCCGATTG ATCAGTGGTA TGCTGAACGG
CCCGCCCTTA TTCAGGCCTT GATCGCAAAA CTTCAGGCCA CGCCGCCATT GCGCCTGCGT
CGCATCACGC TGCTTGAGGG TAGCCCCGAA GCCGTGGCAA GCGCCCACGC CAATTGGCAG
ATTCAAGCCT ACCTACAACC TGGGTTTGAT CAAGCCCAAC GGGTGGTGTG CCAAGGAGCG
GAGCAGGTGG TAGCCAAGGT GTTGGGACTG CATGCCACGC CTACGCCACG GCTCGATACG
CAGACGAGCA TACAGATAAT GGCCTTGCGG ATTGCAACTC AGCACCTGCC CAGCCATCGG
CTTGCCTTCA ATCAGCAAGC CCAGCGGCTG TTGGCCGAGC TGTCGTTTGA GCAACGGTGC
ATCATCGACG ACGATTGGGA ACGTCTCCAA TTAAGTGATG CGCCGCAACC ACTGGCTACG
AGCCAGTCGC TCGCGGTTGG GCAGCAACCA CGAGCGCAGA TCACGGTCGA ACAGGCTCGC
CAAACATGTC GCCAAGCGAT CAACAACCAG CAAAGCGTGA CCGTGCGCTA TTACACGCCA
GCCGAGCATC GCATCACGAC GCGCACGATT CGCCCGCTCG AGCTGACCAG CACCGGGATG
CGCGGCTGGT GTGAATTACG CCAACAGGAG CGGGCTTTTC GCTTTGATCG GATTCTGGCA
ATCGACCTGA ATTCGAGCTA A
 
Protein sequence
MAYSSSAVKL HATNALHAVA VRTLRTLAHY TDCSRQRSQT GHALAAALQR HWQTPHYRQL 
VRRSLTAADR ALLQAWWQGQ QPLPTPQALD LWRWQAPWPT LEQLSSEQRL AALGLVVPIR
TTTGRTVVLI NDTSRWLRRT PPLPPTPVAA SLQALFQAVV ALLAACANTP QPRQAAGLAL
HIAQSAGWLA DRLNQWRITP RGRVWLHSPI AEQQRLLHQQ LITCNPPARG LVAWRSPDWA
ALFADLERLM EAQAQRRSMD VAALLHDHPA WNGLPAAQQI RLVHGWLRTV LQPAGVVSLA
KGWLFWHGWQ QLAAQAPAFD GLRLPKRAAL PAALQVWGLT WGMATSHGWR ITQASVAAAL
ANGLDLSSFW QPIDQWYAER PALIQALIAK LQATPPLRLR RITLLEGSPE AVASAHANWQ
IQAYLQPGFD QAQRVVCQGA EQVVAKVLGL HATPTPRLDT QTSIQIMALR IATQHLPSHR
LAFNQQAQRL LAELSFEQRC IIDDDWERLQ LSDAPQPLAT SQSLAVGQQP RAQITVEQAR
QTCRQAINNQ QSVTVRYYTP AEHRITTRTI RPLELTSTGM RGWCELRQQE RAFRFDRILA
IDLNSS