Gene Haur_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1048 
Symbol 
ID5732952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1196371 
End bp1197825 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content49% 
IMG OID641278183 
Producthypothetical protein 
Protein accessionYP_001543824 
Protein GI159897577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00243617 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATGGT TTAGAGTGTT GTTTGTGCTA TTGATTGTAG CAGGATTTGG CCTGACATGG 
TATCTTTTGC AGCCTCAAAA TAGCTTGGTT TGGCAAACAT TCACCCCTAC GGTTATCCCG
CTGGAACAGG CTGAGTTGGC CAATCCTGGG CGTGGGCTAT ATCAATGGCG TGGCCAAACC
ATGATCTGTC CTAGCGAGTT GATTTCGAAT CGCGAACGCT ATGATCGCTG GACATGGGCC
GAACTTGAGC CAAACGAAAA TCAATACGAT TGGCAAGAAA TTCATCAATT ACTGGATATG
GCTGAGCAGA ACGGCCAGCG GGTTTGGCTC GGTTTGGGGG CAAGTGCTGG CCCAAGTAAT
AATGGCCCGT TTTTACCAAT TTACTTGCAG AAACCTGAAT TTGGCGCAAA CTTTGAAGGC
GATTGGTATC CAAATTACAA TCATCCTTTT GTGCAAAACC GCCTCGAAGC CTTGTTGGCA
GCCTTTGTGG CCGAATTTGC GGGCGATCAG CGGATTTTGG GCGTGCAGAT GCGCAGTTAT
GGCCGTTATG GCGAGGGCTA TTTGCCATGG AACGCCGATA AAAGCCATGG AATGTGGGCC
AGCGAAAGTA CGGCCCGCTG GCTGGTTGAT GCTTGGCATA CCCGACTCAG CCCGCATTTT
CTGATTTCAA TTCCGCTGAG TAATAATCCA GTGTTTTACT ATGCCATGAC CAAGCAACCC
TATTGGAGCA TTACCCGTGA TGCCTTGGGC ATGCCTGAAC AAATGGGTAA TATTGATCAA
CTCATTCAAA GTGATATTAC GGTTGATGGC CAAGCAATTG GCCCGTTGGT GGCTGAGCGT
TGGAAAGTAG CGCCGATGTT TAGCGAGATG ATCGGTGAAT ATGGTGAGCG CGATTATAGT
GGCCAGTTTT TGGCAGCCCA AACCCAAGTG ATTTCATATC ATATTTCGTA TGTGAGCAAC
GGCAATTTTG CCCAGCCCTA TCGCGCAAGC CCGTGGGATT TCTGGCGTGA TCCAATTAAT
TGTCCTGAGC AAGCCAGCAA TTGGAGTAAT GCCGATATTG AGAATTTTAT GCTGGCGGGC
AAATTAGCCG GTTATCGCTA TGCCCCAACC ACGATCAAAC TCGCCATCGA TAACCAGCAG
CTCCAGATCG AAAGTAGCTG GCAAAACGCT GGAGTTGCCC CGATATATGA GCGTTGGCCG
TTGGTATGGC AATTACGCGA TGCTACTCAG ACTGTGGTTT GGCAAGGCGA ATCAAGCCTT
GATTTACGCC AACTTTTGCC AGCCCAAGCC TATGAGCATC GCCAACAATT TGAACAATTC
AAGCTGCCAG CGGGTGAGTA TGAATTGCGA TTGGTTGCTC CAGCGATCAA TCGCTATGTA
CGGCCTTTGC AATTGGCAAT TGAAGGCCAG CTGGATGATG GGGCATATCG GATTGGAATT
CTGAGCATTC GTTAA
 
Protein sequence
MRWFRVLFVL LIVAGFGLTW YLLQPQNSLV WQTFTPTVIP LEQAELANPG RGLYQWRGQT 
MICPSELISN RERYDRWTWA ELEPNENQYD WQEIHQLLDM AEQNGQRVWL GLGASAGPSN
NGPFLPIYLQ KPEFGANFEG DWYPNYNHPF VQNRLEALLA AFVAEFAGDQ RILGVQMRSY
GRYGEGYLPW NADKSHGMWA SESTARWLVD AWHTRLSPHF LISIPLSNNP VFYYAMTKQP
YWSITRDALG MPEQMGNIDQ LIQSDITVDG QAIGPLVAER WKVAPMFSEM IGEYGERDYS
GQFLAAQTQV ISYHISYVSN GNFAQPYRAS PWDFWRDPIN CPEQASNWSN ADIENFMLAG
KLAGYRYAPT TIKLAIDNQQ LQIESSWQNA GVAPIYERWP LVWQLRDATQ TVVWQGESSL
DLRQLLPAQA YEHRQQFEQF KLPAGEYELR LVAPAINRYV RPLQLAIEGQ LDDGAYRIGI
LSIR