Gene Haur_4830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4830 
Symbol 
ID5736675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6158223 
End bp6159584 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID641281995 
Producthypothetical protein 
Protein accessionYP_001547588 
Protein GI159901341 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAGC AAGCGATTCG TGATTATCAC GCCCTCCTCA CGCCCGATCT CGCTCGTGCT 
TCTCAAGAAC GTCTCACCAA CCTACAACAG CAACAAAATC TCTTTTTTGG CACCCGTCCG
TTGTGCAATG TGCTACGCCC ACATTTTCTC AGCGTCGAGC AATCGAGCTT GATCGAGCGC
GTCTCGCAGT TGGTGGCCGA AGCATCGCGC ACGGTGGTTG AGTATGCTTT GCGCACGCCT
GAGGTGCTCG ATCTGCTGGC CTTGACCGAG GGTGAACATC AATTAATTAG TTACGAGCCT
GGCTACCGTG AATTAAGCGT TTCTTCACGG CTTGATTCAT TTTTAACCAG TGATAGTAGC
TCGTTTCAAT TTGTTGAATA TAACGCCGAA AGCCCCGCCG CAATCGCCTA CGAAGATATT
CTTTCGCAGG TGTTCGAGCA ATTGCCGATT ATGCAAGAGT TTCAGCGCCA TTATCGGGTT
GAAAGTTTGC CTGCTCGGCA ACGCTTGTTA GAAGCTTTTT TGGCGGTTTA TCGCGAATGG
GGTGGCACAG GCGAGCCAAA AATTGCCATT GTCGATTGGC ATGGCCTGCC GACGCTCTCG
GAATTTCAAC TATTTCAGCA ATATTTTGCT GAGCATGGCT TGAAAACGGT GATTTGTGCG
CCGGAAGATT TGCGCTATCA GGCTGGCACG CTGTATGCCA ATAACACACC AGTTAATTTT
GTCTATAAAC GCTTGTTGAC AACCGAATTT TTGCAGCGTT TGGGCAATGA AGCCTTTGAT
CATCCATTAA CTCAGGCCTA CCGTGATGGA GCAATTTGTT TGGCCAATAA TTTTCGCGCC
AAATTGCTGC ACAAAAAAAT GATCTTTGGC TTGTTATCTG ATCCGGCGAT CACTAGCGCC
GCTGGAATTA GCTCAGCCAC CCAGCAACAG CTGGCCCAGC ACATTCCTTG GACGCGGCGA
GTGACGGCTG GCCGCACCGA TTATGCTGGC ACAGAAGTTG ACCTACTCGA TTTTATTCGG
CAAAACCGTG ATCGACTGTT GCTCAAGCCC AACGACGATT ATGGCGGCCA CGGGATTACG
ATTGGTTGGG AAACCGAGGC CGAAGCTTGG GATTTGGCGC TACAGCAGGC CTTAACTGAG
CCATTTGTGG TGCAAGAGCG CGTAGTGATT GCCTACGAGG ATTATCCAGC TATGGTGGAT
GGTCAATTGC AGATCGGCCA GCGCTTGGTC GATACCGATC CATTTTTATT TGGCAGCGAA
GTTCAAGGCT GTCTGACGCG CTTATCGACG GTGACGTTGC TGAATGTGAC CGCCGGCGGC
GGCTCGACCA CACCAACGTT TCAGCTCTCT AAACTGAGCT AA
 
Protein sequence
MLEQAIRDYH ALLTPDLARA SQERLTNLQQ QQNLFFGTRP LCNVLRPHFL SVEQSSLIER 
VSQLVAEASR TVVEYALRTP EVLDLLALTE GEHQLISYEP GYRELSVSSR LDSFLTSDSS
SFQFVEYNAE SPAAIAYEDI LSQVFEQLPI MQEFQRHYRV ESLPARQRLL EAFLAVYREW
GGTGEPKIAI VDWHGLPTLS EFQLFQQYFA EHGLKTVICA PEDLRYQAGT LYANNTPVNF
VYKRLLTTEF LQRLGNEAFD HPLTQAYRDG AICLANNFRA KLLHKKMIFG LLSDPAITSA
AGISSATQQQ LAQHIPWTRR VTAGRTDYAG TEVDLLDFIR QNRDRLLLKP NDDYGGHGIT
IGWETEAEAW DLALQQALTE PFVVQERVVI AYEDYPAMVD GQLQIGQRLV DTDPFLFGSE
VQGCLTRLST VTLLNVTAGG GSTTPTFQLS KLS