Gene Haur_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3059 
Symbol 
ID5734931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3865591 
End bp3866880 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content53% 
IMG OID641280203 
Producthypothetical protein 
Protein accessionYP_001545825 
Protein GI159899578 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAA CCCGTCGTCA ATTTGTAGTT GGCTGTAGCA GCGCGATTGC AGCTATGGCT 
GGTGGTCGGC TGGGTGGTTT GGCTTTTGCC GAGCCAGGTG ATATTAACCG TGATATTTTT
GTGGTGGTGT TTCTGCGTGG TGGTTGCGAT GGCATCGGTA TCGTTTCGCC GCTTGATGAT
GCCAATTTTC AAGCCGCCCG TAGCACAATC ACCTTTCCAA GCAGTGGCAC AGGCGCGGGC
TTTGAATTAG GTTCATTGAG CAATGTGCCG TTTTGGTTGC ACCCCAAAGC GGCTGCCTTC
AAAGAACTCT ACGATAGCCA AGATTTGGCC TTTATTCACG CTAGCGGCTT GACCAACGGC
ACCCGCAGCC ACTTCGATGC CATGGATTTT ATGGAACGTG GCACGCCCGA CAATAAATCA
ACTAGCACAG GTTGGCTGAC CCGCCACATG GCTGCCACTC GTCCCGATGG GGTTGTGCCA
GTTATGTCAA CAGGATCAGC TTTACCTGCT TCGTTGCTTG GCAGCCCGAA TGCCGTCACG
ATTTCGAACG TGCAGCGTTA CGCTATGCAA GGCTACTCGA CCTATGGGGC GCAACAACAA
GCCTCATTAA ACGAAATTTA TAGCCAAACT GGCAGCTTGC TTGATGGCCC AGCCACCCGT
TTGCTTAGCT CAATCGCCGC AGTCAAGGCA CGCAACCCCG CCAATCCCTA CGTGCCAATT
ACCACCTATC CTGCTGGGGG CTTATCGGAT TCGCTCAAAG CCATCGCCCA GATGATCAAA
CTGGATGTTG GTTTGCAAGT TGCGACGCTT GATTTTGGTG GCTGGGATAC TCATGAATCG
CAGGTGCCAA TTTTGGGCAA CCAACTTGAT TTATTGACGC GTTCGCTGCA TGCCTTCTAC
AACGACTTGG TTGATTACCA CAGCAAGTTG ACGATTGTGG TGATGAGCGA ATTTGGCCGT
CGCTTGAAGG CCAATCGTAG TGCTGGCACC GACCATGGCC ATGGCAATTT GGCGATGGTT
TTGGGCGGCA ACGTCAATGG TGGGCGAATT TTCGGGCGCT GGCCAGGCCT CGCCAATGCC
CAACTCGACC ATGGCGTTGA TTTGGCGATT ACCACCGACT ATCGCACGAT TTTGAGCGAA
ATTGTGGTGC GCCGCTTGCG CAACAATCGT TTAGGCTTGG TTTTCCCACA AATTAGCCAA
TATCAACCGC TTGGCTTAGT ACGGGGCACC GATCTAACAA TTGATTGGAC TTCAGGCTTC
CGCTCATATT TACCAATGGC CCGCCGCTAG
 
Protein sequence
MDLTRRQFVV GCSSAIAAMA GGRLGGLAFA EPGDINRDIF VVVFLRGGCD GIGIVSPLDD 
ANFQAARSTI TFPSSGTGAG FELGSLSNVP FWLHPKAAAF KELYDSQDLA FIHASGLTNG
TRSHFDAMDF MERGTPDNKS TSTGWLTRHM AATRPDGVVP VMSTGSALPA SLLGSPNAVT
ISNVQRYAMQ GYSTYGAQQQ ASLNEIYSQT GSLLDGPATR LLSSIAAVKA RNPANPYVPI
TTYPAGGLSD SLKAIAQMIK LDVGLQVATL DFGGWDTHES QVPILGNQLD LLTRSLHAFY
NDLVDYHSKL TIVVMSEFGR RLKANRSAGT DHGHGNLAMV LGGNVNGGRI FGRWPGLANA
QLDHGVDLAI TTDYRTILSE IVVRRLRNNR LGLVFPQISQ YQPLGLVRGT DLTIDWTSGF
RSYLPMARR