Gene Haur_2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2192 
Symbol 
ID5734079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2776447 
End bp2778759 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content53% 
IMG OID641279333 
Producthypothetical protein 
Protein accessionYP_001544960 
Protein GI159898713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000184948 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGTA AACGCTATCT TATGCTAATC GCCCTCGTTT TAGCCAGCAC ATGGCTGCTA 
CCGGTTGCTG CCCAAGATCA AACCAAGCAG CAACCATCGG CAGTTTTAGC ACCTTCAGGC
GAAACAATTA CCTACCAAGG CATGTTACGC GAAAGTGGCA GCTTGGCAAA TGGTGTGTTC
GATTTTCAAT TTGGTTTGTA CGTTGACGCA GTCGGTGGCA CAGCTTTAGG CGTAGTCACC
CGCAATGATG TGACTGTGAG CAATGGTTTA TTTACTAGCG AACTCACCTT TAGCGAAGGC
TTGTTCAATG GTGAAAGTCG CTGGATCGAG CTAGCAGTTA AGGCCGATGC TAGCGGCAGC
TATACCACGC TCACGCCCCG CCAACAAGTG ACAGCCGCGC CCTTGGCTTT AGCCTTGCCA
GGCTTCTGGA CACGCCAAAA CAGCTTTAGC CCCAACTTAA TTGGCGGCTA CGAAGGCAAT
ACTGCCTCAT CGTTGGCGAT TGGGATCACA ATTAATGGTG GTGGCAACTC AGGTGGCCTG
AATAGCGCCT ACGATAACTA TAGTTCAATT GGTGGTGGGG CTGGCAATAG CGTCGGCAGC
AACGATGGTA GCCCAAGCAA CGATGTCTAT TCAACCATCG GTGGGGGCAT CAACAATACC
GCCAGCCAAG AATACATTGT GATTGGCGGT GGTCAAACCA ACAACGTCAG CGGCGCTTGG
TCAACGATCG GTGGTGGCAT CAACAATGTG ATTAATAATA GTCGCTATAG TGTAATTGCT
GGTGGTGGTG GCACAACTGG CAACACAATT TACGATGATT ATGGCACAAT TGCTGGGGGT
AGCGAAAATA TCGCAGGTTT AACTGGCGAG GCCGTCAGCC AAATGTATGC CACTGTTGGC
GGTGGTCGGG CCAATGTTGC CAGCGATAAC TATGCGATCG TCAGTGGTGG CCGCAGCAAC
ACTGCCAGCG ATGATTACTC AACAGTTGCT GGTGGCTACA ACAACAGTGC TGCTGCTCAA
TATGCCACAA TCAGCGGCGG TGGCAATTCC AACAGTGCCC AAGCCAATCG TGTTTACGAC
GATTATGGCA CAATCGGCGG TGGTACAAAT AACGTAGCGG GCGTAACTGG CGATACAACT
GGTCAGCAAT ATGCCACGGT TGGTGGTGGC AATGGCAATA CCGCCAGCGA AGATGGTGCG
ACGGTTGCTG GCGGCAGTGG CAATAGCGCT AGCGCCAACT ACACCACAGT TGCTGGTGGC
ACTACTAATG CTGCCAGTGG CGATACCAGC AGCATCGGTG GTGGCCAACT GAATGCAGCC
AGCGGTGGCT ATTCAGTAGT TGCTGGTGGA CGCGGCAATA CTGCTTCTTC CAATATCGCC
GGAGTTGGTG GTGGCCAATC GAATCAAGCG ACCAACACTG GCGCATATGT CGGGGGCGGC
CAGACCAATA CAGCGACTGG CCAATATTCG GTCGTGGCTG GTGGTGTCAA TAATGATGCC
ACTAATACCT ACGCTTCCGT AGTTGGTGGT ATCAACAATC AAGCGGCAGG TGCGGGTACA
TTTGTCGGCG GTGGGCAAAA CAATAATGCC AACAGCACCT TATCGGCAAT TCTGGGTGGT
AGCGGCAATA CAACCTTGGC CGATTATACC GTTGCCGCTG GCGAAAATGC CGTCGCTGCT
CACGCTGGGA GTTTTGTTTG GGCTGGTCAA CAAGCCAACA AAGATGATAG TATTTCGACG
ACTGGGCCAG GTCAGTTTAT CGTGCGTGCT CCAGGTGGGG CGTGGTTTGG CAGCAGCACC
AAGGTCGATA TGCCTGATGG CGCGATTTTG GCAACTGAAA GTGGCGCTTT CCTGAGCAAA
GGCGGTACAT GGGCCAATTC ATCAGATAAA AATCTCAAAT CAAATTTCGC CACGATCGAT
CCCCAAGCTG TGCTCGATCA GCTTGCCAGT ATTCCCGTAC AAGCTTGGAG CTACAACAGC
GAAGGTGCAG CAGTTCGCCA TATTGGCCCA ACCGCCCAAG ATTTTTATGC CGCCTTTGGT
TTAGGCACTG ATGATCGCCA CATTGCCACG GTCGATGCTG ATGGCGTGGC CCTCGCTGGA
GTCCAAGGTT TATACAATTT AGCCACCGAA CAAGCTCAAT TACTCGATCA ACAAGCCGAG
CACATGGCGG CACTTGATGC ACGACTCGCA GCCTTGGAAC ATAGCCAAAA TCCTCAAACC
AGTCTCCCAT GGTTGTGGTT GATCGCGATT GCCGCAGTTG GATTGGGCTT GGGGTGGATG
CTTGGTCGTC GCAGCAAGGG GCAACGCGCA TGA
 
Protein sequence
MIRKRYLMLI ALVLASTWLL PVAAQDQTKQ QPSAVLAPSG ETITYQGMLR ESGSLANGVF 
DFQFGLYVDA VGGTALGVVT RNDVTVSNGL FTSELTFSEG LFNGESRWIE LAVKADASGS
YTTLTPRQQV TAAPLALALP GFWTRQNSFS PNLIGGYEGN TASSLAIGIT INGGGNSGGL
NSAYDNYSSI GGGAGNSVGS NDGSPSNDVY STIGGGINNT ASQEYIVIGG GQTNNVSGAW
STIGGGINNV INNSRYSVIA GGGGTTGNTI YDDYGTIAGG SENIAGLTGE AVSQMYATVG
GGRANVASDN YAIVSGGRSN TASDDYSTVA GGYNNSAAAQ YATISGGGNS NSAQANRVYD
DYGTIGGGTN NVAGVTGDTT GQQYATVGGG NGNTASEDGA TVAGGSGNSA SANYTTVAGG
TTNAASGDTS SIGGGQLNAA SGGYSVVAGG RGNTASSNIA GVGGGQSNQA TNTGAYVGGG
QTNTATGQYS VVAGGVNNDA TNTYASVVGG INNQAAGAGT FVGGGQNNNA NSTLSAILGG
SGNTTLADYT VAAGENAVAA HAGSFVWAGQ QANKDDSIST TGPGQFIVRA PGGAWFGSST
KVDMPDGAIL ATESGAFLSK GGTWANSSDK NLKSNFATID PQAVLDQLAS IPVQAWSYNS
EGAAVRHIGP TAQDFYAAFG LGTDDRHIAT VDADGVALAG VQGLYNLATE QAQLLDQQAE
HMAALDARLA ALEHSQNPQT SLPWLWLIAI AAVGLGLGWM LGRRSKGQRA