Gene Haur_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3909 
Symbol 
ID5735770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4899285 
End bp4901336 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content51% 
IMG OID641281060 
Producthypothetical protein 
Protein accessionYP_001546671 
Protein GI159900424 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTTT ACGCTCGTAT CGATATTGAA TGGTCGGCAG CACCAGCCCT CATCCACGCT 
AGCCTCGCTG CTCGACAAGG TTGGTTGTTG GGAGCTGTTA TCGCTGATCG CTGGCTTTGG
GCTGATCAAA CTTGGCAGAT TACTCCAAAC CATAGCTCGC AACGCTTGCA TTGGCAAGCA
CAGAGCATCA CGCAACCAAG CCTCAAAGCC GATTTGCTGC TCGAATTGCT GGATAGTGTT
TCCAGCACTC ACGTTCGGGC CACCTTGCAT GTTGAATGGC CTAAAACCAC GTTTAAATTA
TGGCGCTATT GGCAACGCCG TCGCTGGTTA CACAACGAAT TAACCAACGT TTTGCAACAT
TGGCGCGACG CTTGGACAAC CCAATCCACG CCAAGCGACC AACTTGCCCA AAGCCATCCG
CGCACATGGC AAGCTTTCCA CACCATGAAT GCGCTCGATC AACTCCAACG GATTCAAGCA
CTCGATCGAC GTTGGCAGCA ATTTGAGCAT GGCCAACTGC CCAACTTGCC CTATAGCCAG
ACCGATCAAT TGCCCAAGGC TGATCTGGCG GTCGATTTGG TCTATGCTGG CGGCGGCTTG
GGCTTGATTC ATGCCACGCT GATGGCTCGC AAAGGGCTAA ATGTGCTGGT GTTTGATCGG
CATCAGGTTG GTTGCGCTCA TCGCGAATGG AATATTTCGC AGGCTGAACT TGAACGCTTA
GTAGCAACTG GCTTTATCTC GTGGGAAATC CTCGAACGCC AGATTATTAT GGCGCGTTAC
CACGATGGGA TTGTGCGCTT TCACGCGGCT GGCTCTGCCG TTGCTCCTGC CGAATTACAC
TTACCCGAAG TATTAGATAT TGCGCTCGAT GCTGGTGCGT TGCTCGATTA TGCCCGCCAA
CAGTTTCTGG CTGCAGGCGG CATTATTTGG GATAACACCA GTTTTGAGCA TGTCTATCAC
GATCCAAAGC AGCAACAAAC TGTCGTAGCC GTGCACAAAG CAGATCAAAC GCAACTGATA
GCTGCCCGTT TGCTGATCGA TGCTATGGGT GCAACCTCGC CATTAACCCT GGCAACCCAA
CCATTTGCTG GCATCTGTCC AACCGTTGGC ACGGTGCTTA GTGGCGCGGA GCATGACCAA
AACTTGGGCG ATATTTTGAT CAGCATTGCC GATACCCAAG CTGATCGCCA ACTGATTTGG
GAAGGCTTTC CAGGCCGTGA GCATGAACTG ACGGTCTATG TGTTTTACTA TGATCAGGTT
GGAGCCAAAG CCAAATATCG CCATTCGCTG CTGGATTTAT TTGAAGATTA CTTTGAACTC
TTGCCAAGTT ATAAGCAGCT TCAAGCGAAT GCTCAGCATC TCCGTCCGGT TTTTGGCTAT
ATTCCAGCGC GTCATGCCTT GAACAAACCT AAACCTTTGG CAGGCGTTTT GGCTTTGGGC
GATGCCTCAG CCCAACAATC GCCGTTGACA TTTTGTGGCT TCGGCTCGTT TGTCCGCAAC
TTAAGCCGTA CCACCGATTT GCTCGAACAA GCACTTGAAC AAGCGTTGCT TGCCCCTCAG
CAGCTTAGTT TGATTAGCGC CTATCAAAGC AATGTCAGCA TGAATTGGGT ATTTAGCCGT
TTTATGACTC CATGGGGTCG CCCGCAAGAT GTCAATGAGC TGCAAAATGT CTTTGCTCAT
GTGCTCAATC GCTTGGGATA CGACCTCGCA CGGCGCTTTT TCCAAGATCA AATGACTTGG
CACGATTATA ATCGGGTCGT CTTAGGCACG CTGGCCTTCT ACCCACGGAT TATGCAAGTT
GCTTGGCAAG TGCTTGGTTG GCGCGATTGG CTGCGCTGGA TTGGCGATTG GCTGCGATTT
AGCCGCGCAG CATTCATTGC CCAGTTTGGT CAACAATTAC CAAGTTGGTT GGTTGGCCGC
TTGCCTAAAC CATGGCTTTT TCAGTACAAT GCAGCCTATG CCGAATGGCG AGCGATGGGT
TGGCTCAAAT CTAGCCCTGA GCATCAAAGC CAAGCGCTTG GTTCACAACC CTCAATCAAG
CAATTTGGCT AG
 
Protein sequence
MHVYARIDIE WSAAPALIHA SLAARQGWLL GAVIADRWLW ADQTWQITPN HSSQRLHWQA 
QSITQPSLKA DLLLELLDSV SSTHVRATLH VEWPKTTFKL WRYWQRRRWL HNELTNVLQH
WRDAWTTQST PSDQLAQSHP RTWQAFHTMN ALDQLQRIQA LDRRWQQFEH GQLPNLPYSQ
TDQLPKADLA VDLVYAGGGL GLIHATLMAR KGLNVLVFDR HQVGCAHREW NISQAELERL
VATGFISWEI LERQIIMARY HDGIVRFHAA GSAVAPAELH LPEVLDIALD AGALLDYARQ
QFLAAGGIIW DNTSFEHVYH DPKQQQTVVA VHKADQTQLI AARLLIDAMG ATSPLTLATQ
PFAGICPTVG TVLSGAEHDQ NLGDILISIA DTQADRQLIW EGFPGREHEL TVYVFYYDQV
GAKAKYRHSL LDLFEDYFEL LPSYKQLQAN AQHLRPVFGY IPARHALNKP KPLAGVLALG
DASAQQSPLT FCGFGSFVRN LSRTTDLLEQ ALEQALLAPQ QLSLISAYQS NVSMNWVFSR
FMTPWGRPQD VNELQNVFAH VLNRLGYDLA RRFFQDQMTW HDYNRVVLGT LAFYPRIMQV
AWQVLGWRDW LRWIGDWLRF SRAAFIAQFG QQLPSWLVGR LPKPWLFQYN AAYAEWRAMG
WLKSSPEHQS QALGSQPSIK QFG