Gene Haur_4295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4295 
Symbol 
ID5736154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5482942 
End bp5484849 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content47% 
IMG OID641281455 
Producthypothetical protein 
Protein accessionYP_001547055 
Protein GI159900808 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTT GGATTGTGTT GGCAATTTTC GTCGTCGGTT TAGGGCTTCC CCAAGCCACA 
CAAGCTACCG AATTGCAATG TTTTGAGCAA ACAGGCTTTT GTACCGATGG TCGTTTTTTA
GAGTATTGGC GGCAAAATGG TGGCTTAGAA GTCTTCGGTT ATCCGTTGAG CACAATTGAC
ATCGTTTACA ATCAGGATAG CCAGATGCAT TTCCTGACTC AGCAATTTGA GCGTGCTCGC
TTTGAGTTTC ACCCTGAATT TGCCGCGCCC TACGATATGT TGCTTGGGCG GTTGGGCGAT
GATCTGTTGC GCTATCGTAA TATCGATAGT GCCATGTTGC CACGTGAGGC TGGCGCAACA
TCAGGCTGTT TGTGGTTTGA AACAACTGGA CACAACGTCT GTAACCAAGC CAATGGCCTC
GGTTTTATGA GCTATTGGCA AAACCATGGC CTCAACGATC CCAAACTTGA TGCCTTTGGC
CGTTCATTGC AATTATTTGG CTACCCGCTG ACTGAGCCAG CCATAGAAAC CAATGCGAAT
GGCGATAGTG TACTCACCCA ACATTTCGAA CGCGCCCGTT TCGAGTGGCA TCCCAACCAA
CCTGATCAAT TCAAAGTGCT GCTTGGCTTA GTCGGCAAAG AATCGCAAAA ATTAGTCTAT
GGCGCAAGTG CCGATCCATC AAAATTGACC TTGGTTGGCG ATACGCTCTT TTTCACCGCC
GACGATGGCG TGCATGGCCG TGAATTATGG ACAAGCGACG GCACCGAGGT TGGCACACGC
TTGGTCAAAG ATCTGAGCGT TGGCACTGAG TGGGGTGGAA TTTATGAATT AGCCGCTGTC
AATCAGGGCG TTATTTTTGC CGTCAACAAG CAGGATAAGG ACTATCAATT ATGGTATAGC
GATGGCATTG AAGCTGGCAC ACGCTTAATC AAAAGCTTTG TTCCAAACGC TAAGATCAAC
AATCTCAAAT CATTAGGCAA CGGGATCATC TTTTGGGCAA ATGATGCAGT TCATGGGATT
GAACCATGGT ATAGCAACGG CACTGAAGCT GGTACATATT TGCTGAGTGA TATCAACCCA
GGCCTTGCAG ATTCAGCAGT TTCCAACAAT TATGGATCTT ATTGGGTTGA CTATGCCCCC
ATCGCGGGAG GTATGGCCTT TTTTGCCCAA AATAACCAAA TTGGCAATCA AATCTGGTGG
ACAGATGGCA GCATTGCCAA CACTCGCCAA ATCAGCAATT TAGCAATCTC ATTTGGTATG
CTTGAGCTAG AAGTGTTAGA TCAGCAACAT TTGATCGCAA CAGCCTATCA AAATACAACG
ATGGGGGTTT GGAATATAAC GCTTGCTACT GGCGAACAGC AACTATTAGC CAGCTATCCT
GCAATTGCCA CAACCCGTAA TCCAGCATCA GCTATACAAC TAACCCAAGC TGGTGGAAAG
GTCTATTACC TTAGCAAAAC CCAAGCAGGC GAGCTTAGCC TATGGCAAAC TAATGGCCAA
GCCGATCAAA CCATCCAACC CAATCTGCAA GGCTACAACG CCGAACATAT CGTAGCTGCA
AACGATCAAT TGTATATGCG ACTGACTAAT CCTCAAGGCA TTCAGGCTGG CTGGTGGTAT
TTCGATTCAA GCCAAGGGTT GACTCAACTA ACGCCATTAC CACTGCATAT CCATGCCGCG
AACAATCGCC TATTGGGATG GGAATCGATC GCTGGAGGAT TACGGTTCTA TAGCACTAAT
GGGCCAAATC AAGCCCTACG CTATCGTAGC TCAGTTATGG GCAAGCAGAC ATACTTCCCT
GATACGACCA ATGATCGATT TTTCGCCATC CCTAGTTTTC AGTATGGCAC GGAGCTATGG
TCTAACGATG GCAGCACCCT GCGGATGGTC AAAGATATTC AGCCATAA
 
Protein sequence
MKRWIVLAIF VVGLGLPQAT QATELQCFEQ TGFCTDGRFL EYWRQNGGLE VFGYPLSTID 
IVYNQDSQMH FLTQQFERAR FEFHPEFAAP YDMLLGRLGD DLLRYRNIDS AMLPREAGAT
SGCLWFETTG HNVCNQANGL GFMSYWQNHG LNDPKLDAFG RSLQLFGYPL TEPAIETNAN
GDSVLTQHFE RARFEWHPNQ PDQFKVLLGL VGKESQKLVY GASADPSKLT LVGDTLFFTA
DDGVHGRELW TSDGTEVGTR LVKDLSVGTE WGGIYELAAV NQGVIFAVNK QDKDYQLWYS
DGIEAGTRLI KSFVPNAKIN NLKSLGNGII FWANDAVHGI EPWYSNGTEA GTYLLSDINP
GLADSAVSNN YGSYWVDYAP IAGGMAFFAQ NNQIGNQIWW TDGSIANTRQ ISNLAISFGM
LELEVLDQQH LIATAYQNTT MGVWNITLAT GEQQLLASYP AIATTRNPAS AIQLTQAGGK
VYYLSKTQAG ELSLWQTNGQ ADQTIQPNLQ GYNAEHIVAA NDQLYMRLTN PQGIQAGWWY
FDSSQGLTQL TPLPLHIHAA NNRLLGWESI AGGLRFYSTN GPNQALRYRS SVMGKQTYFP
DTTNDRFFAI PSFQYGTELW SNDGSTLRMV KDIQP