Gene Haur_4998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4998 
Symbol 
ID5736834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6340261 
End bp6345663 
Gene Length5403 bp 
Protein Length1800 aa 
Translation table11 
GC content52% 
IMG OID641282165 
ProductNa-Ca exchanger/integrin-beta4 
Protein accessionYP_001547756 
Protein GI159901509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCAAT CCCCTGATCA AGGGCGAGCC TTGCGCATGC AGGTTCGTCG TCGTGCATTG 
GTGCTTAATT TGGCACTATT GTTGGCATTA TTTGTTGTTC CGCTTCAACC AACAGCCGCA
CGCATTGCAG CACCACAGCT TCCTGAAGCC AGTTCTGCGA GTGCTCCAAG CGCTGCTTGT
ACGGTTACCA ACACCAACGA TGCTGTTTCG CCGCCAGTTG ACTCGTTGCG GGCAGCCTTG
GCGAATGCAG CATGTTCGCC AATCCTATTC GACCCAAGTT TGGCGGGCCA AACAATCACG
TTAACTGCTG GCGAATTGAC CGTTGGACGC TTGGTGGTGA TCGATGGAGC CGCCGCGCCT
GGCCTGAAAA TCGATGGCAA CAACGCCTCA CGGATTTTCA ATGTTAATGT CACGTCAGCC
AATCTCTTAA TTTACAACTT GACGTTGCAA CGTGGTAAAG CGCCCAACTC GGGCATTACC
AACGGCGGTG CGGTGCTTAA CCAAGGTGGC ACGGTTGTTA TCAGCAACAC TGCTGTACTT
TCGAGCACTG CTGGCACCAG TGGTAACGGC GGGGGCTTGC GCAACCAACG TACCAGCGGG
ATCGCCGGTG TGCTCAGGGT CTACAATAGT GTGATTCGCG GCAATACTGC GCCAAATAAT
GGCGGTGGAT TTAGCACCAG CGGCAGTATC ACCACTATCG TCAATAGCGC TGTGTTGAGC
AATACCGTCA CCAGCACTTC GGGTGGGGTG ACAGGTAGTG GTGCATTGGG ATTGGGCGCA
GGCATTCACC ACGTTAATAT TCAAAATCCT GCGGCTAACC TAACCACGGC GATCACCAAT
ACCACGATTG CTTATAACCG TGGCAAGGCT GGGGCAGCAA TCTACAACGA TAATAATGGT
ATGATCGATG TTTACTACAG CACGATCGCC TACAACGCTG CCTCGAATAG CAGTGGTTCG
GCGGTTGGTG GAGTAATCAA TGGGGTTACT ACCGCCGGCC AGCCGGCTTT ACGCCTCAGT
TTGTATAACA CGATTGTTGC CAATAATACT CAACGCGCTG CCAATACCAC CCTTACCAGC
GCTGATCTTG GGGTTGGCAC TGGTCAAACA ATTACTTCAA ATGGCTACAA TTTGATCGAA
ACAGTGCCAA GTGGCGCGGT CTTTGGCGGC ACAACTGCCA CCAACATCAC TGGCCAAGAT
CCAGTGTTAG GCGATGTGCT GAATAATGGT GGTGGTACGC CAACAGCAGC ATTGCTGCAA
AATAGCCCTG CCTACAACAC TGGCGATCCT GCGCCTATTG CAGCCCCAAC GACCGATCAG
CGTGGGCCTG GCTTTGTGCG AGTGCGCGAA GGCCGTTTGG ATATTGGCGC ATTTGAAGCA
ACCAACAGCA TCCAAACATC GCCAATTTTG GTTAGCACCA ATTTGGATAC TAATGATGGA
GCTTGTACAA CGTTCGATTG TTCGCTGCGT GAAGCGATCA CTTTAGCCAA TTCAACCCCG
CAAACCGATA CAATTCGCTT TAATTTGCTT GGCAGCGGTG TGCGCACCAT CTCGCCAACG
ACGGCCTTGC CAGCGATTAG TGCTCCGGTG ATTCTCGATG GTTTGAGCCA AGCGGGCGCT
GCCTGTAATG CCCCATTGAT CGAATTAAAT GGTTCGTTGG CAGGAGCAAG TGCCAATGGC
TTGGACGTAA CAGGCGGCAG TAGCCTGATT CGTGGGTTGG TGATCAATCG TTTTGCTGGC
AACGGCGTGC GTTTGGCAAC CCTCGGCGGC AATACGCTCG AATGTAATTT CATCGGCAGC
GATGCTTCAG GTACGGTCGA TGCTGGCAAT AGCTTATCGG GAGTGCGAGT CGAATCTGCC
AACAACTTAA TTGGCGGCAG CGGCGGTTCG AATGTGCGCA ACTTGATTTC GGGCAACGAT
GGCGATGGCA TCACGCTGGT GGCTGGCGCG AATGCAAACA CCATCCAAAA TAACTATATT
GGTACGATGC TGAGTAGCAA CGCAGCGCTT GGCAATAGCA AAAGCGGCGT GCGCATCGCC
AGCGATAATA ATCAAATTGG TGGCACATTT GGCACATTGG GCAACGTCAT TTCGGGTAAC
TTCGAACATG GTGTGTTGGT CAATGTGACA GCTGCACCGT TGGGTAACCG CATTCAAGGT
AACTTTATTG GTACGAACAT TGGGGCAACC CTCGATGTGG GCAATACCCA GCGCGGGGTC
TTTATCGACC GCACCGCCAA TACCTTGGTT GGTGGCACGA CGGCTGAGCG CAACGTCATT
TCGGGCAACA ACAGCGATGG CGTGGCAATT ATCGAAGCTG GCGCTACCAA CAACAAAATT
TCGGGCAACT ACATCGGTAC CAACAACGGT GGTGGCTCAG CCTTGGGCAA TAGCTCGGCT
GGGGTCTATA TTTTCAATGT TGGTGGTAAC ACGATCGGCG GTACGACCAC TGCCGAACGC
AACTTGATTT CAGCCAATAG CAACGGGATT GTGATTGGCG GCGGCTCGGC GATTGGTAAT
ATCATCAAGG GCAATTACAT TGGCACAACC CTGAATGGCA ACGGAGCCTT AGGCAATGTC
AGCGATGGCA TCAAGCTCGA CGGTGCACGT AACACGCAAA TTGGTGGTAC AACCATCGGC
GAGCGCAACG TGATTTCAGG CAATGGCAAT AATGGGATTA ATATTTTCAC CTCAAGCTCG
TCGGGCAATG TGATTCAAGG CAATTACATC GGCACGAACG CCTTTATTCC GCCAGCTAGC
CCAACTGGGG TCGCCAACCA ATTCAATGGC GTGCGGGTCG CGGCTGGCAC GAACACAACC
ATCGGCGGCT CGGCTCAAGG TGCTGGCAAC GTGATTGCCT TCAATGCCAA AAATGGGGTG
CTGATTACCC AAGGCGCAGC GGTTGCCACG ACCCAACGGA TTCAGCGCAA TAGCATTTTC
AGCAACAATC GCTTGGGCAT CGATCTTGGC CCCGACAATA ATGCTAACGG CGATGGCGTA
ACTGCCAATG ATGCCAACGA TAGCGATACT GGGCCAAACA ATCTGCTCAA CTATCCAATT
GTTGAATCGA CCAGCTCGGG TGGGGGCTTA ACCCAAATTA TTGGCTCGTA TATTGGCGCA
ACCAATACGG CCTTGACCCT CGATTTCTAT AGCCAAGGCG CGTGCGATCC AAGCAACTTT
GGCGAAGGTC AAAGCTACAT TGGCTCGTTT GCAATCAACA CAGGCCCAAC CGGAGCAATT
TCTTACACCG CCAATTTGAC CACGACTGTG GCACTGGGCC AACGGGTAAC CGCAACCGCC
GTCGATGGCA GCGGTAACAC CTCGGAATTC TCACGCTGTT CATCGGTAGG CAACTTGCCC
AGCGTGAGCG TCAACGATGT AACATTAACC GAAGGCAACA GCGGCATTAC CCAATTCAAC
TTTACCCTGA GCTTATCGGC AGTTAGTCCT AATCCAGTTG TCGTCGATTA TGTAACGGCT
GACGATACAG CGATTGCGCC CAATGATTAC ACTGCAGCAA GCACAACCGC CACCATTCCA
GCCAATACCT TGAGTATTCC GGTGACGATT GACGTTCACG GCGATACGCA ATTTGAACTC
AACGAACGCT TCTTTATCAA TTTGTTGAGT GCCACCAACG CGGCAATCGG CGATGGCCAA
GGCATTGGCA CAATTACCAA CGACGACACT GCTCCAAGCA TCAGCGTGAA TGAAGCAAGT
GCGAATGAAG GCAGCGGCGC ACCAGGCCAA GTCAATGTGC CAGTCACGCT CTCGAATGCC
AGCTATTTGC CAATTAGTGT TGAATTTATC ACAACTGATG GCACTGCCAA CGAATCCAGC
GATTACACTA CGATCACTGG CACACTCAAT TTTGCTCCAG GCGAAACCAG CAAAACAATT
GCAATTGCCG TGGTTGGCGA TTTGATCGAC GAGCCAGACG AAACGATCAA CGTGGCCTTG
AGCAATGCCG TCAACACCAC AATTAGCAAT GATGAAGCAG TTGCCACGAT TCTCGACGAC
GATGCAACGC CTGTGCTCAA TGTGGCCAAT GGCGAAGTAA CCGAAGGCAA TAGCGGCAGC
GTAACCGTCA CGCTCAATAT CAGTTTGAGT AACCCTAGCA GTAGCGCAAT CACGGTTGAT
TATCTGACTG TGGGAGGCTC GGCGAGTGCT GGCAGCGATT TTGCGATCAG CAGTGGTCCA
TTAAGCTTTG CCGCTGGTGA AACTAGTAAA ACCGTCGATG TGTTTATCCA AGGCGATATG
GTTGATGAAA GCGATGAGTC GTTCACGCTT GAATTGAGCA ACCCAGTCAA TGCAACGGTT
GGGAGTGCTG GTACGGTGGT TATTCTCGAC GATGATGCTG CTCCAATAAT CAACTTTGTT
GCGCCGCCTA TGCAAGAAGG CAATAGTGGC ACGCGACCAT TGACGGTTAG CTTGATGCTC
TCGGCAATCA GCGGCCAAGC AATTAGTGTC CAGTATGCGA CCCACGACAA TACAGCGCTT
GGCGGCAGCG ATTACACCAC GATCAGCGGC ACCTTGACGA TTCCGGCAGG CCAGCTAACC
CAAAGATTTA TTGTCAATGT CAATGGTGAT CTGATCGATG AAAGCAACGA AACCTTCACC
GTGACGCTGA GCAATCCGCA AAATGCTAGC TTGTTGGCTC CTGATGCAGT TGCAACAATC
ATCGACGACG ATGGTGCGCC AACGATCAAT GCTGATGCCA TTAACGTTAC CGAAGGCGAT
GCCAACAGCG TTAACGCCGT ATTCAACGTC AGCCTATCCA ACCCTAGCGC TACCGCTGTA
ACCGTGAACT ATTCAACTTT AGCTGGAACG GCAGCCAGTG GCACAGACTT TACACCCGCC
GCTGGTACAT TAAGTTTTGC GCCTGGTGAA CTTAGCAAAA CCGTGACCGT GGTTGTGCTT
GGCGATCTCG TGGATGAGGC TGATGAGACC TTTACCCTCG TTCTGAGTAG CGCGACAGGT
GGCGCAACGC TTGGCTCAAA CGGCACAGCC ACCATTGTTG ATAACGACCC AACCCCAAGC
GCTAGCTTGA GTGGTCTCAA CAGCGTGGTT GAAGGCTCTG GCATCACCAC AACCTTACAA
TTTACCGTGA CGCTTTCGGC GGCGAGTGGG CGGGCTAGCA GCCTACAATT TGCCACGACA
GCGGGCACGG CCAGCGCTGG TAGTGATTTT GTCGGCCAAA ATTTGATGCT CAATTTTGCT
GCTGGCGAAA CCCAAAAGGT CGTCAGTGTG GCAATTGTCG GCGATCGGGT GAAGGAGCCA
AACGAAAGCT TTAGCGTTGC CATCAGCAAT CCCAGCAATC TCACGCTTGG CACAGCTACC
ATCAGCGTCA CAATTGTTGA TGATGATGAA TGGCTGGTGT ATATGCCGTA TGTAGTGAAA
TAG
 
Protein sequence
MLQSPDQGRA LRMQVRRRAL VLNLALLLAL FVVPLQPTAA RIAAPQLPEA SSASAPSAAC 
TVTNTNDAVS PPVDSLRAAL ANAACSPILF DPSLAGQTIT LTAGELTVGR LVVIDGAAAP
GLKIDGNNAS RIFNVNVTSA NLLIYNLTLQ RGKAPNSGIT NGGAVLNQGG TVVISNTAVL
SSTAGTSGNG GGLRNQRTSG IAGVLRVYNS VIRGNTAPNN GGGFSTSGSI TTIVNSAVLS
NTVTSTSGGV TGSGALGLGA GIHHVNIQNP AANLTTAITN TTIAYNRGKA GAAIYNDNNG
MIDVYYSTIA YNAASNSSGS AVGGVINGVT TAGQPALRLS LYNTIVANNT QRAANTTLTS
ADLGVGTGQT ITSNGYNLIE TVPSGAVFGG TTATNITGQD PVLGDVLNNG GGTPTAALLQ
NSPAYNTGDP APIAAPTTDQ RGPGFVRVRE GRLDIGAFEA TNSIQTSPIL VSTNLDTNDG
ACTTFDCSLR EAITLANSTP QTDTIRFNLL GSGVRTISPT TALPAISAPV ILDGLSQAGA
ACNAPLIELN GSLAGASANG LDVTGGSSLI RGLVINRFAG NGVRLATLGG NTLECNFIGS
DASGTVDAGN SLSGVRVESA NNLIGGSGGS NVRNLISGND GDGITLVAGA NANTIQNNYI
GTMLSSNAAL GNSKSGVRIA SDNNQIGGTF GTLGNVISGN FEHGVLVNVT AAPLGNRIQG
NFIGTNIGAT LDVGNTQRGV FIDRTANTLV GGTTAERNVI SGNNSDGVAI IEAGATNNKI
SGNYIGTNNG GGSALGNSSA GVYIFNVGGN TIGGTTTAER NLISANSNGI VIGGGSAIGN
IIKGNYIGTT LNGNGALGNV SDGIKLDGAR NTQIGGTTIG ERNVISGNGN NGINIFTSSS
SGNVIQGNYI GTNAFIPPAS PTGVANQFNG VRVAAGTNTT IGGSAQGAGN VIAFNAKNGV
LITQGAAVAT TQRIQRNSIF SNNRLGIDLG PDNNANGDGV TANDANDSDT GPNNLLNYPI
VESTSSGGGL TQIIGSYIGA TNTALTLDFY SQGACDPSNF GEGQSYIGSF AINTGPTGAI
SYTANLTTTV ALGQRVTATA VDGSGNTSEF SRCSSVGNLP SVSVNDVTLT EGNSGITQFN
FTLSLSAVSP NPVVVDYVTA DDTAIAPNDY TAASTTATIP ANTLSIPVTI DVHGDTQFEL
NERFFINLLS ATNAAIGDGQ GIGTITNDDT APSISVNEAS ANEGSGAPGQ VNVPVTLSNA
SYLPISVEFI TTDGTANESS DYTTITGTLN FAPGETSKTI AIAVVGDLID EPDETINVAL
SNAVNTTISN DEAVATILDD DATPVLNVAN GEVTEGNSGS VTVTLNISLS NPSSSAITVD
YLTVGGSASA GSDFAISSGP LSFAAGETSK TVDVFIQGDM VDESDESFTL ELSNPVNATV
GSAGTVVILD DDAAPIINFV APPMQEGNSG TRPLTVSLML SAISGQAISV QYATHDNTAL
GGSDYTTISG TLTIPAGQLT QRFIVNVNGD LIDESNETFT VTLSNPQNAS LLAPDAVATI
IDDDGAPTIN ADAINVTEGD ANSVNAVFNV SLSNPSATAV TVNYSTLAGT AASGTDFTPA
AGTLSFAPGE LSKTVTVVVL GDLVDEADET FTLVLSSATG GATLGSNGTA TIVDNDPTPS
ASLSGLNSVV EGSGITTTLQ FTVTLSAASG RASSLQFATT AGTASAGSDF VGQNLMLNFA
AGETQKVVSV AIVGDRVKEP NESFSVAISN PSNLTLGTAT ISVTIVDDDE WLVYMPYVVK