Gene Haur_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4095 
Symbol 
ID5735954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5226737 
End bp5228713 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content52% 
IMG OID641281247 
Productmembrane protein-like protein 
Protein accessionYP_001546855 
Protein GI159900608 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC ATCCCACTCG CAGCCTCAGT TTTGGCCTTT TGATCATCGC GCTGGGCTTA 
CGCTTCTTCC GCCTGAGCAG CCAAAGTTTA TGGCTCGATG AAGGCGGTAC CTGGAGCGAG
GCTACTGGCC GTTCGTGGCT CAGTTTATTC GGCGATCTGT TTAGCCCACG CCAAAGCTAC
CCACTGTATC ATTTGCTGAT GAAGGCTTGG ACGAGCGTGT TTGGCGATAG CGAGCTGGCC
TTGCGTGCGC CTTCGGCGCT TGCTGGGACG CTGGCGGTGG TGTTGCTGTA TCATTTGGCA
TGGCGTTTGA GCACACCCAC AGTGGCTTGG GTTGCGGCTG GCTTGTTGGC GATCAATCCG
TTTGGTTTGT GGCAAAGCCA AGATGCCAAA GTCTATAGCA TGTTGATGGC GGCGGTGCTT
TGGTCGAATC GAGTATTGCT CGATCTGCTT GATCAAACAA AACCGAAGCA GGTTTGGCTG
TGGTTTGCCA GTGTGGTGCT CTGCCTAAGT TTGCATCGTT TGGCCTTGTT GCAAGTGCTC
GGTCAAGTGT TGGTATTGGC GTTGCATTTC AGGCAAGCAG CCTGGCGCAA ATGGCTTTGG
CTTGGGTTTG GCTTGCTCAG TCTTGGTTTT GTGGCAGGAA TTATGTTTGG CTTGCGCCAA
GACCCGAATG CACCGCCGAT TGGCCGCACG ATTGGGGCAT TTCAAGCAGC TTGGCTTTTG
CTTGGCCGCC TCAGTTTCGA CCGTTCGAGC AGCGATTTAA CTTGGCGTTT GTTGCCTGTG
GGTTTAGCAC TCGCCAGCGG TTTATGGCTG ACATGGAGCC ATCCACGGCG AAACATTATT
TTAAGCTTGT GGCTCTTGCC CACGCTGCTC TTTTTGGCGG TATTGGCAGC AGGCTCGCCG
CTCTACGAGC CACGCTATCT CAGTTTTAGC TTGCCGCTAT TTTGCTTGAT TTTGGCCTTA
GGCTTGGCTG GTATTGCTCA ACAACGCTGG CTAAGTGCTG GCAGTATCGT GGCGATTACA
GGGTTAAGCG CTTGGTTGGT GTTTCAGCAG CCTTATGGCA TTTGGAGCGG CAACGCGGTC
AAGGAAGATT ATCGTGGGGC CGTCCAAAGT TTGGCTGAGC ATGTTGCCCC TGATGATGTG
ATCATCGTGC AACCTCCCTA TATTGCAACC TTGTATCAAT ATTATGCCAG TCGTGTTACA
CCCGATGCGT TGCCTATTGC GCGTGGGTTT GGGCGCATCG GCGCAATTGG CTATGATCAA
GGCGAGTTTG ATAATGATTA TGCTGCGTTA TTGGCAGGCC AACGCCGTGG CTGGCTGTTG
ATTGCACCCG AAAATGCCAA AGCGATCGAT CCGCCTAACC CGCAATATCC CCAAGATGAT
ATGGGCAAAG TTGGGATTAA TTTTCTGACT GCTGATTTAA ATGATAAATG GCGTTGCACC
GATTTACCCT ATTGGGAATT TAATGCATTG CGGATTTTGT GTCAGAGCTT TCCACGGCCA
ATGCAAGTCG AAAATTTGGC GCTGGTAGGC ACATGGCCCA TTCCCAAATC GGCGACTGCA
ACCTGGGATA ATAGCTTACA ACTTGAGGGC TACCAATTTC AAGCTTGGGC TGATGGCTAT
CAGGCAGGTG GCACATTGCC TGTGCAACTG GCTTGGCGCA CCAACAACGT TTTAGTCAAA
GATTATCGTT TATTTATTCA TTTAGTGCCA GCGTTGGGTG CACCAGTTGC TGCCCAAGTT
GATACGATGC CGCTCAATGG TGGTTTGCCA ACCAGTCGCT GGCAGGCCAA TCAAGCGATT
CATGATCAAG TTTCTGTGCC CTTGCCCAAA AATCTTGCCA AAGGCCGCTA TTTGGTCGTG
CTCGGCTGGT ATGATCCAAC AATTACCGAG GTTGCGGCTC AGCGCTTGCC CGTGACAGCC
AGCACCGCCC AGCATAGGGC CAACTACATC GAATTGGGCA CGGTTGAGAT CGAGTAA
 
Protein sequence
MPKHPTRSLS FGLLIIALGL RFFRLSSQSL WLDEGGTWSE ATGRSWLSLF GDLFSPRQSY 
PLYHLLMKAW TSVFGDSELA LRAPSALAGT LAVVLLYHLA WRLSTPTVAW VAAGLLAINP
FGLWQSQDAK VYSMLMAAVL WSNRVLLDLL DQTKPKQVWL WFASVVLCLS LHRLALLQVL
GQVLVLALHF RQAAWRKWLW LGFGLLSLGF VAGIMFGLRQ DPNAPPIGRT IGAFQAAWLL
LGRLSFDRSS SDLTWRLLPV GLALASGLWL TWSHPRRNII LSLWLLPTLL FLAVLAAGSP
LYEPRYLSFS LPLFCLILAL GLAGIAQQRW LSAGSIVAIT GLSAWLVFQQ PYGIWSGNAV
KEDYRGAVQS LAEHVAPDDV IIVQPPYIAT LYQYYASRVT PDALPIARGF GRIGAIGYDQ
GEFDNDYAAL LAGQRRGWLL IAPENAKAID PPNPQYPQDD MGKVGINFLT ADLNDKWRCT
DLPYWEFNAL RILCQSFPRP MQVENLALVG TWPIPKSATA TWDNSLQLEG YQFQAWADGY
QAGGTLPVQL AWRTNNVLVK DYRLFIHLVP ALGAPVAAQV DTMPLNGGLP TSRWQANQAI
HDQVSVPLPK NLAKGRYLVV LGWYDPTITE VAAQRLPVTA STAQHRANYI ELGTVEIE