Gene Haur_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0646 
Symbol 
ID5732546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp744697 
End bp746262 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content42% 
IMG OID641277775 
Producthypothetical protein 
Protein accessionYP_001543422 
Protein GI159897175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000786066 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCGT ACCATCGTGT CCGTATGCTC TTCGCATTGA TCCTGCTTTT CAGTTTATGC 
AGCACCGCTC CTGCGTCTTC TAGCCAAGCA ACATCCCAAG CACTTTCCCA TGCCGTTATT
GATGCTAGCG GGGTTCTTGA TAACTGGCAA TCACTTGCCA TAGGGTCAGA TGGATTAGGC
TTAATTGCGT ATTGGGATGA AACAGCTAAA GTACTCAAGG TTGCCCATTG TAATGATGTC
GCTTGCCGCA CCGCAACTAT TACTCCTCTT GTTACAACGA CTACCGAGTT CGGGAGTATT
GAACTTACCA TTGGAAGGGA TGGTTTAGGG AAAATTATCT ATAACGTGGG CGGGCAAATA
TTCTTAGCCT TTTGCCAAAA TATTAGTTGT ACGAGTATTC AGACTAAACC TATTCACAGT
GGTGGAAATA GCCAGCTTCT TATAGGAAGT GATAGTAACC CAATCATTTT TAGTACGACT
GGTTCTAGTC CTGACAATTA TCAGATCAAG GTATCACATT GTGATGATCC ACAATGTCAA
GCTATCACTA CAACAAAACT CACCGATACC TTAAAAACAT CTTCTCCTTT AGTTGCTGCT
ATCGGTAGTG ATGGATTTCC ACTTATTTTA TACTATGATA ATGATCTGCT TCAACACATG
CTTATTCACT GTTCCGATGT CGCTTGCCAA CAAACAACGA CTTCAGCGCT TCCTATAGCT
ACCTATCAAG CAAACACCGA TAGTGATATG ACCATAGGTA GTGATGGATT TCCAATCATT
ACCTATAAAA CTATTCAATT TGATCAATTA GGTATCATTC ACTGTGAAAA TATTCTTTGT
ACCACCTATA CAAATGTGTA TCATGATGAT TTTTGGCTGG GTACCCATCC ATCAGTTATT
ATCAATAGTG ATGGATTACC GCTCGTGAGT TACTCTGTTG ATCGTAATCA ACGCAGGGAT
CTTTATATTT CCCGTTGTCT TGACCTTGCC TGTAACGATA TAACGACCCA TATTTTAGAT
AAAACAACGT GGATAGGGAC GCAAGATATG GCGCTCGGAA ATGATGGCTT GGTGTTGATC
GCCTATCATG ATCTCTCTCG AGCACAATTA CGCGTTATCC ACTGTAAAGA TATTGCCTGT
AGTGAAGATG GGGTTAATCT TTTTGCTCAG TATGCACCGA TGCTGATGCA AACACCAACA
TCGATGGTTG TTCCGATCAA TGCAACAGCT ATTCCTGATC AAGCAGTCAC CACCCTTGGC
CAAGTCTTCT TTACGACGTC GATTTCAATT CCTCAGCCCA TTCCTAGCGG TGGGAGATAT
GTTCTTGCGG GGAATGCACA AGGTACAGCA CCAAGCATTG TGGATGATAA AGTGGTTCTC
AAAGCAGGAA ACCAAACGAT TTTTGAATTT GAGTATGGTG GTACGGGAAC TCCTAATCCT
ACTTTAGTTG AGGTTCCGAA TGCCATTATT GAATCACATA TTGGACAAAC CATTACTATA
GAATTCCGTG ATGTGTATGG TGGACGAATC CAAGCTTCAA CGATGTATCT GGTTTGGATT
CCCTAG
 
Protein sequence
MVSYHRVRML FALILLFSLC STAPASSSQA TSQALSHAVI DASGVLDNWQ SLAIGSDGLG 
LIAYWDETAK VLKVAHCNDV ACRTATITPL VTTTTEFGSI ELTIGRDGLG KIIYNVGGQI
FLAFCQNISC TSIQTKPIHS GGNSQLLIGS DSNPIIFSTT GSSPDNYQIK VSHCDDPQCQ
AITTTKLTDT LKTSSPLVAA IGSDGFPLIL YYDNDLLQHM LIHCSDVACQ QTTTSALPIA
TYQANTDSDM TIGSDGFPII TYKTIQFDQL GIIHCENILC TTYTNVYHDD FWLGTHPSVI
INSDGLPLVS YSVDRNQRRD LYISRCLDLA CNDITTHILD KTTWIGTQDM ALGNDGLVLI
AYHDLSRAQL RVIHCKDIAC SEDGVNLFAQ YAPMLMQTPT SMVVPINATA IPDQAVTTLG
QVFFTTSISI PQPIPSGGRY VLAGNAQGTA PSIVDDKVVL KAGNQTIFEF EYGGTGTPNP
TLVEVPNAII ESHIGQTITI EFRDVYGGRI QASTMYLVWI P