Gene Haur_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4060 
Symbol 
ID5735918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5184765 
End bp5185979 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content47% 
IMG OID641281211 
Producthypothetical protein 
Protein accessionYP_001546820 
Protein GI159900573 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000137982 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCCAA AAACACCAGA ACCAATCATC AAAACAATCC CATCTTGGCG CGATTTAGCC 
CGTTGGACAA TCACCGCATT TGCGATTTGG CTGGTGGCGT GGTTGCTTTG GCGCACGGGA
AATCAGCTCT TGCCGTTTGT AGTTGGTTTG GTGTTTGCCT ATTTGCTCTT GCCCTTGGTC
AACAAGTTAG AGCGCTGGAT TCCGCGCTGG GCCGCGATTT TGGTGGTCTA TATAGTTGGC
TTGGGAATTG TAACAGGCTC GATTCTCTAT ATTGTGCCGC CTGCAATCGA CCAAGTGAAT
GGGTTTGGTA AATCATTGCC TGAATTTTAT AAAAACACCC TCGAACCCAA AATCAATGAA
GGTCTAAAGT GGTATCGGAG CGAAGTGCCC GACGAGATTC AAGAAGATAT TGATAAGCAA
GTGAGTAAAG GCATCACTAC ACTCAAAGAA AATGCTACTA ATTATGTTGA AACAGGCGTG
AATGGAATTT TGAATGGCTT GGGGGTGATT TTTCAAACAA TTATCTTCCT CGCAGGCTTT
TTGATTATTC CATTTTGGCT GTTTTATGTG CTGCTTGATG AACGCAAAGG CAAGGCAGCC
CTGATTCGCA TGATTCCCAA AGCGGTGCGA ACCGATGTAT TGACCGTGCT ATCGATTTTT
GATCGGGTGT TTTCGGCTTA TATTCGGGGC CAATTAACGC TTGGCTTGAT TATCGCAATT
ATGTCGTACA TTGGCTTGTG GATTGTTGAT TTGGTGATGC CTGGCGAAAT TCCCTATAAA
TTGCTGTTGG CCTTGGTTGC AGGCTTCACC GAATTAATTC CGGTGATTGG GCCGATTATT
GGGGCAATTC CGGCGGTAAT TGTTGGCTTA ACCACCTCGT TGCCAATGGG CTTAGTCGTC
GCTGGCTTGT ATATTGTGAT TCAGCAAATT GAAAATAATT TCCTTGTGCC ACGGATTATC
GGGGCAATTG TGGAAATTCA TGAAGCAGTT TTGATGCTGC TGTTGGTGAT TGCTGGTACA
GTTTCGGGCT TGCTTGGGGT AATTATTTTC GCCCCGATGG CGGCAGTGGC CCGCGATAGC
TACCAATATA TCACTGGTCG CTTGCGCCAA CCCAACGATC CACGCTATTT GCGAGCTGGC
GAGTTGCCGT GGGAACATAA AGAAGAACCT GAAACGCCGA TGCCGCCGAT GTTGGCTTTG
CAAAATAAAG CCTGA
 
Protein sequence
MEPKTPEPII KTIPSWRDLA RWTITAFAIW LVAWLLWRTG NQLLPFVVGL VFAYLLLPLV 
NKLERWIPRW AAILVVYIVG LGIVTGSILY IVPPAIDQVN GFGKSLPEFY KNTLEPKINE
GLKWYRSEVP DEIQEDIDKQ VSKGITTLKE NATNYVETGV NGILNGLGVI FQTIIFLAGF
LIIPFWLFYV LLDERKGKAA LIRMIPKAVR TDVLTVLSIF DRVFSAYIRG QLTLGLIIAI
MSYIGLWIVD LVMPGEIPYK LLLALVAGFT ELIPVIGPII GAIPAVIVGL TTSLPMGLVV
AGLYIVIQQI ENNFLVPRII GAIVEIHEAV LMLLLVIAGT VSGLLGVIIF APMAAVARDS
YQYITGRLRQ PNDPRYLRAG ELPWEHKEEP ETPMPPMLAL QNKA