Gene Haur_2707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2707 
Symbol 
ID5734588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3460627 
End bp3462438 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content51% 
IMG OID641279850 
Producthypothetical protein 
Protein accessionYP_001545473 
Protein GI159899226 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0029913 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACGAA CACATGCTTC AATCGAATGG CTGGCCAGTT TACCTACCAG CGAACGCCAT 
GTGCTGGCAC ATTTATGGCA AGTTGCCGAT GCTGCTCAGC TTGGGCAACC AAGTGCCATC
CAAGCCTTGG TTGCGCGTTT ATCCACGCCT GAACGCTCAG CGCTTGATCG GGTGATTGCG
GCTGGTGGTA AGCTCGCGGC CAAATCGCTT GAACGTGAGT TTGGCAAAAT TCGCTCACAT
CGGGATATTG TGACTCCTCG GGCCTATTTA TTGGCGCTGC ACGGCCAAGC TTCGGTGCTT
GAACGTCTCT ATATTTTGGG CTTGCTTCAG CCACTCAAAA CGCTGGACGG CGAGTATTAC
GTGATTTTTA GCGATTGGTT GCAGGCGTTG CCAGCAGTTT CGGCCCCCAG TTTGCCAACC
TGGCAGCAAC ATCCCAGCCC AAGCAACTGG ATCGAAGCAG ATCTCAATCA AACCGAAACG
CTGCTGACTA CGATCTTGGC GCTGTGTTAT CAGCAACCGC TCCAGCTGAC CCGCCAACTT
CAACTTGAAC GCGATGGTTT GAAAGCAATC TGTCAACGCA TAGCAGTTTC TACGCCAGCA
AGCGAACGCC AATTTCCTCA ATTAGCCTGG CTGCGAAATT TGGCGCTTGA GGCTGGCTTA
TTACACATCC AACATCAACA GCTTCAGCTC GCTGGCAACC CAATTAATTG GCTAGAAGCT
ACGCCCAAAC AGCGGTTAGA GCGCTTATTT AATGCCTGGT TGGTTTGTGA TTTTGATGAG
TTTAGCTTGA CTGAGTTGCA GCCTCAGACT CCATTTACCC TCCAAGCTGC TCGCCAAGCC
TTATGGCAAG TATTAACAAC CGCGCCACCC GATCAATGGT TGGCCTTCGA CGATCTACTG
GCACAAATCC AAGCCTTACA TCCCGAACTG TTGCGCAGCG ATTTCGAGCA GCCCGTAATT
CACAATCAAT CCAATGATTC ATTCGTTGGT TGGCAACATT GGGCCAAAGT TGAGGGCGCA
TGGATCAAAG CCGCCTGCCA AGGGCCATTG TTCTGGCTCG GCTTGCTTGA TGTCGATCAA
CTTAACCATC CACAGGCTTT GCGTTTAACT CAATGGGCAA GCTGCTTACT CGATCCAGCG
CACGAGCCAA GTCAATTTGC TGGGCAACTA CAACTAAGCA GCGATGGCCT GATTCGGGTT
CCACCAACGG TTGAGCCGCT GCCGCGCTTT CAAATCCAAC GCATCACCGA ATGGCAATCA
ACCGACAGCC ATGGCACCAT GCTCGTGCGC TTGACCGCCC ATTCGTATAG CCAAGCCTTG
CAACGTGGCA TTCAGGCCAG CCAAATGCAA ACATTTTTGC AACGCTGGTG TGACCGACCA
GTGCCAAACG ATTTGCAAAG CTTATTTCAG CAATGGCAAA ACGATCGCCA GCACTTATTG
GCTCGTCCGG CTTTATTGCT GGAAGCCGAT GATCCCAGAT TACTCAACGA GCTGGCTAAA
CTGCCTAACT TACCACCCTA CGCCGAGCTT AATCCCCAAC TTTGGGAATT GGAAATAGCT
GATAGTGCCG CATTAACCAA TCTGTTGCAT ACAGCAGGCT ATGCAATCAA CCCAGTCAGC
GAGCCAGATC AACGGATCAG TGACCATGAT CTTAAACAGT TGATTACGGC CTTATTGACG
GTTCAGCGTT TAGCGCCAAC TGTGGTCAGC CAAGCAGTGA TTGAGCGGGT GGTGCAGGCC
TTGCCCAGCA GCGAACGCCA ACAGCTCACA GCCAACGTCA ACCAATGGCT ATCAATCATT
AATCGAAGCT AG
 
Protein sequence
MIRTHASIEW LASLPTSERH VLAHLWQVAD AAQLGQPSAI QALVARLSTP ERSALDRVIA 
AGGKLAAKSL EREFGKIRSH RDIVTPRAYL LALHGQASVL ERLYILGLLQ PLKTLDGEYY
VIFSDWLQAL PAVSAPSLPT WQQHPSPSNW IEADLNQTET LLTTILALCY QQPLQLTRQL
QLERDGLKAI CQRIAVSTPA SERQFPQLAW LRNLALEAGL LHIQHQQLQL AGNPINWLEA
TPKQRLERLF NAWLVCDFDE FSLTELQPQT PFTLQAARQA LWQVLTTAPP DQWLAFDDLL
AQIQALHPEL LRSDFEQPVI HNQSNDSFVG WQHWAKVEGA WIKAACQGPL FWLGLLDVDQ
LNHPQALRLT QWASCLLDPA HEPSQFAGQL QLSSDGLIRV PPTVEPLPRF QIQRITEWQS
TDSHGTMLVR LTAHSYSQAL QRGIQASQMQ TFLQRWCDRP VPNDLQSLFQ QWQNDRQHLL
ARPALLLEAD DPRLLNELAK LPNLPPYAEL NPQLWELEIA DSAALTNLLH TAGYAINPVS
EPDQRISDHD LKQLITALLT VQRLAPTVVS QAVIERVVQA LPSSERQQLT ANVNQWLSII
NRS