Gene Haur_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1942 
Symbol 
ID5733831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2353809 
End bp2355551 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content52% 
IMG OID641279086 
Producthypothetical protein 
Protein accessionYP_001544713 
Protein GI159898466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0873468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTG TTATTCAACA ATCGAAACTA AACGTAAAAA CTAGCTCGTT ATTCGTCTTT 
GGTTTGGCGT TATTGCTCTA TACCATAACC CTCTTACCAG GGCTTGGCGG TGGTGATACC
GCCGAATTTC AGCGCGTTGG CCCAACCCTT GAGGTCGCCC ATAGCACAGG CTACCCGCTT
TATAGCATCC TAACGTGGCT TTGGAGCAAG CTTATTCCAC TTGGGAGTAT TGCTTGGCGG
GTTAATCTCT TTTCGGCGTT CGTTGCCGCC CTCAGTTTAA GCAGCTTCAA CCAACTTGCC
CAAACGAGCG GCCTTGCCCA ACGTTGGGCC TTGGCTGGCA CTGCCATGCT TGGGCTATCG
TCAAGCTTTT GGCAACAGGC TACTCAAGCC GAAATTTATG CGTTTGCGGT GCTTTTGCAG
GTCAGCACGC TTGGGCTGAT CGTCGCTTGG TGGCAACAAC GCATACCGTT TTGGCTGCTT
GGGCTAGCGG CAGGCCTGAT GCTTGGGCAT CATCGCAGCA GCGTGTTTAT TCTGCCATGG
ATGCTGATTG CCTGTTGTTG GCAGCAACGC CCAAGTGTAC GGCAGTGGCT GTTGGCGGTT
GGGGCTGGCT GTTTAAGCTT TCTGCCCTAT TTGTATATCG TTTGGCGTGC CCCAGCTTGG
CAAAATGGTT GGGCATTGCT TACTGATTAT CTGCTTGGCA GTGCTGGCGG GGCATGGTTT
GATCCTCAAC GAGCGCTCGA TCAAGGCTGG CAACGGCTCT TCGAGGTGCA AATCCAGCTG
TTTGGGCCAC AATTAACCTG GCTGGGCTTG GGCTTAGCTG GGATTGGCTG GTGGCAGTGT
TGGCAAAAAC AACGGCCACT CGCAATAATG CTTGGTGGTA GCTATTGCAG CATTTTGGGC
TTTTGTTTGG TCTATTTCGT TGACGATCTG GCGGTGTTTG GGCTGGGAGC CTATGTTGCC
CAAGCCTTGT TGATTGGCTT TGGGTTGGCC GCGTTGCCAT GGCCACGCTG GGGGCTGGGC
TTGGCCTGCG GCATCAGTCT GTGGTTGGCT TGGCAGACAT GGCCACAAAT TCACCAATCC
AATACCGCCC AACCTGAACA ACTTGCCAGA CAACGCTTGG CCGAACCACT CGCTGCCAAT
AGCCTCGTGA TTGGTGATGG TTGGAGCATT GAGAGTTTGC GCTATCTGCA AACCGTTGAG
CAGCTACGGC CAGATCTCGA ATTTAGCTTT CAAGCCGAGC AGCAACGGAT TCGCGATCAA
TTAGCCCAAG GTCGCCAGAT CTATGCCTTG CAGGCGATCC CTGAATTGGG CTTGCAGCAT
CGTAAAGTTG GCGATTGGTG GCAGATCGAG AATATTCCTC AGCAGTTTAA GCAACAAGTC
GCTATTGTGT GGCAGAATGG TATTCAATTA ACTGGATTTG ATTTACCGGC TCAAGCTCAA
GATCAGCTGT TAATTGGGCT ACGTTGGCAA ACCAGCCAGC AATTACCGAG CTTAATTCGC
TTCATTCATG TGCTTGATCA AACTGGCCAG TTGGTTGCGC AAACTGATAG CCCAACCAAT
CCTAGCAGCG AATTTTGGCC CCTAGCCCAA GCCCAAACCG ACCTATTGGC GGTCAATTTG
CCGCAGTATT TGCCAGCAGG CAACTATAGC GTCATTGTCG GCTGGTACGA TTTGGGTGGG
CAGCGGATTG CGCTCACTCC TCAGCAAAAT ACCTATCTGC TTGGCATGGT TATGCTCGAT
TGA
 
Protein sequence
MQFVIQQSKL NVKTSSLFVF GLALLLYTIT LLPGLGGGDT AEFQRVGPTL EVAHSTGYPL 
YSILTWLWSK LIPLGSIAWR VNLFSAFVAA LSLSSFNQLA QTSGLAQRWA LAGTAMLGLS
SSFWQQATQA EIYAFAVLLQ VSTLGLIVAW WQQRIPFWLL GLAAGLMLGH HRSSVFILPW
MLIACCWQQR PSVRQWLLAV GAGCLSFLPY LYIVWRAPAW QNGWALLTDY LLGSAGGAWF
DPQRALDQGW QRLFEVQIQL FGPQLTWLGL GLAGIGWWQC WQKQRPLAIM LGGSYCSILG
FCLVYFVDDL AVFGLGAYVA QALLIGFGLA ALPWPRWGLG LACGISLWLA WQTWPQIHQS
NTAQPEQLAR QRLAEPLAAN SLVIGDGWSI ESLRYLQTVE QLRPDLEFSF QAEQQRIRDQ
LAQGRQIYAL QAIPELGLQH RKVGDWWQIE NIPQQFKQQV AIVWQNGIQL TGFDLPAQAQ
DQLLIGLRWQ TSQQLPSLIR FIHVLDQTGQ LVAQTDSPTN PSSEFWPLAQ AQTDLLAVNL
PQYLPAGNYS VIVGWYDLGG QRIALTPQQN TYLLGMVMLD