Gene Haur_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2800 
Symbol 
ID5734681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3559699 
End bp3560976 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content49% 
IMG OID641279943 
Producthypothetical protein 
Protein accessionYP_001545566 
Protein GI159899319 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0330256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAGC TTTCGTTGAC TCTTGCTCGT TGGACAACTC GTCGTGTCAT GACCGCCACT 
TTAGTCGTCC TCGCGATCAT CGGCCTCGCT TTTGTGATTG TTAATTTTTA TAGTGTGTTT
GTGGTAGCCT TTATTGCCTT TGTGCTCAGC ACCGCGATTC GCCCATTGGT GCAATTATTA
CAACGGCTCA AAATCTCACC CCAACTAGGG GTAATTATTG CCTACCTGCT ATTATTGGCA
CTGCTGGTGG GCATTGTTGT GCTGATGGCT CCGCTGATCA CCGAACAAAT TACCGCAATT
ACTGCCAAAA TTCCTGAGTA TTATCATGAT GCCCGCCAAT TTTTAATTTC TTCGCGCAGT
AGTTTTATTC GCAATTTAGC GTTACGTTTG CCACTTGATG CCCCAATTTC GCTATCTGGA
GTTGCGCCCA GCGAAACCAG CCAGCAACAA ACTGACGTAG CAGTAAGCCA ACTTGTCACT
GTTGTAGAAA ATGCGGGCAT CAGTTTGTTT GTGTTGATTG CCACCTTACT GCTGGGGTTT
TATTGGACGC TTGATGGTGA TCGGGTACTG AGAACCTTGC TCCAGCTCGT CGCGGCAGAA
AAACGTGAGA ATTGGCGTTC GCTGATTGCC GAAATTCAGG CCAAAATGGG CGCATTTATT
CGCGGACAAC TCATCCTTGA TCTCTCGATT GGGGCACTTT CAACCGCCGC CTACTTGCTA
ATTGGTATCG ATTATGCGAT TGTACTGGGC ATTTTGGCTG GTTTACTCGA AACCATCCCG
ATTTTGGGGC CAGTGCTTGG GGCAGTGCCA CCCTTGCTGA TTACCCTGGC CCAAGGTGAC
ACAACCGCCT TTATTTGGGT GATTGTAGCG ACCGTGGTGA TTCAACAAAT TGAAGGCACT
TTTTTAGTGC CCAAAGTGAT GGATCGGGCG GTTGGGGTGA ATGCGGTGCT GACCTTGGTC
GCTTTTGCGG CCTTTAGTGC GACCTTAGGC TTGGCTGGGG GGATTTTGGC CGTGCCATTG
GCAGCCATTG TGCAAATTAT CTTCACTCGC TTGGTGTTTA ATCAAGCTGA AACCAGCACC
AACGTCACTC GCCGTGATCG CTTTGGCGTG CTGCATTATG AATCACAGCA ATTGCTCCAA
TCCTTGCAAC GCCATAATCG GGCTGATGAA ACTGAAGATG AAGCTAATAT TGTGTTTGAC
GATCAGCTCG AACATGTGGT CGTCGATTTG GATGGCATGT TGGCGGCGAT CAATAGCAGC
GAGGAAGTAG CGGCATGA
 
Protein sequence
MDELSLTLAR WTTRRVMTAT LVVLAIIGLA FVIVNFYSVF VVAFIAFVLS TAIRPLVQLL 
QRLKISPQLG VIIAYLLLLA LLVGIVVLMA PLITEQITAI TAKIPEYYHD ARQFLISSRS
SFIRNLALRL PLDAPISLSG VAPSETSQQQ TDVAVSQLVT VVENAGISLF VLIATLLLGF
YWTLDGDRVL RTLLQLVAAE KRENWRSLIA EIQAKMGAFI RGQLILDLSI GALSTAAYLL
IGIDYAIVLG ILAGLLETIP ILGPVLGAVP PLLITLAQGD TTAFIWVIVA TVVIQQIEGT
FLVPKVMDRA VGVNAVLTLV AFAAFSATLG LAGGILAVPL AAIVQIIFTR LVFNQAETST
NVTRRDRFGV LHYESQQLLQ SLQRHNRADE TEDEANIVFD DQLEHVVVDL DGMLAAINSS
EEVAA