Gene Haur_5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5068 
Symbol 
ID5737026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp82056 
End bp83732 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content62% 
IMG OID641282233 
Producthypothetical protein 
Protein accessionYP_001547824 
Protein GI159901578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAA CCCACCCGAG CGGTGGGGCG GAGGTCATGC GCAATCCCGT ATGGCTAGCC 
GCACCCGACC GATCATATCT CTGGTTCCCG ACCAAACTTG CGGAAACCCT CGCCCATGCC
CCGCTCGCGT TAGGCATCTA TGCCCTGATT GCGCGGCGCT GGCTCGCCCA GCATACGTCC
GTTCCACTGT CTGCTCACGA TATTCAGGTC TATGACCCGT CACTGACCCG CGCCAAGATT
CGCACGGCCT TCGACCAGCT TCTGGCTGGC CACTGGCTCG CCATTCCTAC GCCTCCGCAG
CACGGCTGCA AAATCGCCTA TGTGCCCACG TGGGGATGGC AACGCGAAGG CGTGCGCCAG
TGGGAACCCG CGCAATCATT CAATCGCGGG CGGGTGGCCA CCCATCGGCT CGACCGCACC
CTGCTCGACT ACTACCTTGG CCGGATTGAG CCACGGTCGC ACGGCCAGCC ACTCATTACG
CGGTATCTGA CCACGCCCGC CCTGACCTTG ACCGATGTGG GCATCTATAT GCTGCTCGTG
GCCGGGATTC CGCATCGCCA TAGCACAGCA ACACTCCAAC ACCGTGGTTT ATGTCAACAG
AATGTCCCGC TGGCCGTGCC AAGCATTGCG GAGATTCTGG CGCAGGGGAC GATGAGTCTC
CACGGCGCAC AGCGTTTGCA GCTGCTGCCA CGCGGCGGAG TCATGCGGCC CAGCGACCCT
GATCCGCGAC CCCCGCTTTT TTTTGTGGAA CCGGACCTGG CGACGATCAT GGTGATGACC
ATGGCGACGA ACATGGCGAT GGCGGACAGC GTGTCCGAGG ACGGTTTTGA CGCGTCAGGA
AGCAAAAAAA CGGCGGTTGC CCATGATTCA TGGAACGTCA CAGGCAGATT AAGCAAAATA
GACAGAGAAG ATCAACCACC AGAAAACACT CTGCATAGCA TACAAAACGG TGGTGGAGTC
TTTATTTCTG ATCGAACCAA CGAACGATCA GGGAACGAAA TCGACCAAGA TCTACCAGAA
CCCGTTACCA GAACTATTCT GCGCCGTCGC AACATTATGT CAATTCCGGG GACCCCGCAG
GTTGCATTGC TGCGCTCACT GGCCATTCGG CCCAAACAAC TTGCCGAGTT TGCCGATATC
GATTTGGCCA CGCTGGAAAC GGTCGTGGCC GATGCCCGCC AGCGCACGGG CGTTCGCGAT
ATCGGCGGGT GGGTCGTCAG TATCCTGCGG GATATTCAGG ATCACGGCTG GGAACCCGCT
GCCGCCAAAT GGCAGATCGA TCAGCCCCGC GACTTCGAGG CCGCGCGCTC CCGGTGGCAC
ACGGCCTTGG GCCTTGATGC ACCGCCCGAA GCGGAGTCCG AAGCCGCTGA CGACGCCTGC
CGCGAATCCG CCCCGCCCCC CGTCGTGGTT GACTGGGTGG CGCTTGAACC GTTGCTTGAC
ACCACGGATA CGGCTGTGTC GCTCGCCGCT CAGGAGCCGC CGCACCGTCC GTTGTGGATA
CCCGCAGCCC TCTGGTTGCG GCTCCGTGCC TCGGTGCGGA TGCTGCTGAT CGCCAGTCGC
TGCGATAATG GACGTATTAC GGCAGGGGAT GCCTGGCGAC AGGCGCGACT CGCGCTCCCT
GCCTATCGCA CGGTGCTTCC GGCCTTCATA CACGCCTGCG AAGCCCTGCG GGAGTAA
 
Protein sequence
MPSTHPSGGA EVMRNPVWLA APDRSYLWFP TKLAETLAHA PLALGIYALI ARRWLAQHTS 
VPLSAHDIQV YDPSLTRAKI RTAFDQLLAG HWLAIPTPPQ HGCKIAYVPT WGWQREGVRQ
WEPAQSFNRG RVATHRLDRT LLDYYLGRIE PRSHGQPLIT RYLTTPALTL TDVGIYMLLV
AGIPHRHSTA TLQHRGLCQQ NVPLAVPSIA EILAQGTMSL HGAQRLQLLP RGGVMRPSDP
DPRPPLFFVE PDLATIMVMT MATNMAMADS VSEDGFDASG SKKTAVAHDS WNVTGRLSKI
DREDQPPENT LHSIQNGGGV FISDRTNERS GNEIDQDLPE PVTRTILRRR NIMSIPGTPQ
VALLRSLAIR PKQLAEFADI DLATLETVVA DARQRTGVRD IGGWVVSILR DIQDHGWEPA
AAKWQIDQPR DFEAARSRWH TALGLDAPPE AESEAADDAC RESAPPPVVV DWVALEPLLD
TTDTAVSLAA QEPPHRPLWI PAALWLRLRA SVRMLLIASR CDNGRITAGD AWRQARLALP
AYRTVLPAFI HACEALRE