Gene Haur_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3768 
Symbol 
ID5735632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4736125 
End bp4737702 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content52% 
IMG OID641280920 
Productankyrin 
Protein accessionYP_001546532 
Protein GI159900285 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTA TAACGATCGA TGAGCGGCTT TTTGAGGCAC TTCGTTCTGG GCAAAGCTTA 
ATTGCTACTG AGATTCTTGC TGCACACCCT GAGCTAGCAA CCACCCGCTT CGATCCCATC
GATCAATCGC TGAGTGTTAC GTCTTTCGTA TGCCATCCAG AGCAATACGG CAATGCTCGG
ATCGGTCAAT CCCCACTCCA TTTGGCAGCA TGGAACGGTG AACAGCGCTT GGTTAAGCAG
CTACTTGAAC TTGGTGCTGA CCCCAATGCC CGCGATCGGC AGGGTGGCAC GCCGCTGCAT
GCGATGGTAC GCTGGGTTAC CCGACCTGAT ATTGTCGGCA TGGTATTGGA ACGAGGCGCA
GATATTAATG CCGTTGATTA TGCTGGGCAA ACACCATTAC ATTTAGCGGC TAGTTGCATT
CGTCGCCCGG GTCATCAATG GGGCAATCAC ACCGACCTGT GCAACTTTTT GTTAGCACAT
GGTGCAATTG CCGATATTTT CGCGGCAGTC ATGCTTAATT TAACCGATCA GGCGGCGATG
CTGCTCAAGC AAAATCCTGA ACTAGTGCAC GCCCGCACAA CTGGCAATCA GACCCATCCA
GAAAGCGCGA CACCATTGCA TATTGCCGTA GATCGTGGCA AGCAGGCCAT GGCGGAAATG
CTGCTGGACT ATGGTGCTGA TCCCAATAGC CTCGATGCCC GTGGTCGCCC AGCCTTGTAT
CTGGCAGCGC ATATAGCCGG AACGCGCAAA CTAGAGCCAA CCCCTGAACT GGTAGATCTG
TTGTTACAAC ATAGTACAGC TACACCGATC TTCAATGCCA GCCTGATCGG CCAGTGTGCT
GAACTTCGTG AGTTGCTTAT CCACGATCCT GCACAAATTC AGGCGCTTGA TCAAGCTGGA
TATACTGCCC TGCATTTGGC GGCATGGAAT GGTCAAGTTG CAGCGGTTGC CGAATTATTG
GCGCATGATG CCGATATTGC TGCCCGAACC AAACGCAACG AAACCGCCCT GCAACTCGCA
ATAACCTATG GTCACCATGC AACTGCCGAA CTGCTGCTGA ACCATGGCGC AACTCCCGAT
ATATTTAGTG CTGTTATCCT TGGTCGGATT GATCTGCTGG AACAATTGCT GGATCATCAA
TCCGAACTCG CCAGTACCAC CAATCGCTAT GGACGCACGC CGTTGCGGCT GGCAATTGAA
CGTGAGCAAA CAGCAGTTAT CGATTATTTG ATTGGTCGAG AAGTTAAACC CGACCTATGG
ATGGCGGCAG GTATGGGCGA TTTTGCCAGG GTCGAAGCCT TAGTCGAAAC TGATCGTCAC
GCTTTACATC AGCGCGATCA ATGGGGCTAT ACTGCGTTAC ATTGGGCCAG TAAATCTGGG
CAACTTGCGG TGATCGAATA TCTGCTTGAG CAGGGTGCTG GCTTGGAGCC GCGCGGCTCT
GATGGTGGCA CGCCGCTTAC CTTGGCCTTG TGGCATGAAC AATCGGCAGC AGCCCGCCTG
TTGGTTGCTA GCGGCGCTGA TATTGATGCT CTAGACAATT GGGGTGGTTC ACCACGTAAT
CAAGTAGCAA CGCTCTAG
 
Protein sequence
MNSITIDERL FEALRSGQSL IATEILAAHP ELATTRFDPI DQSLSVTSFV CHPEQYGNAR 
IGQSPLHLAA WNGEQRLVKQ LLELGADPNA RDRQGGTPLH AMVRWVTRPD IVGMVLERGA
DINAVDYAGQ TPLHLAASCI RRPGHQWGNH TDLCNFLLAH GAIADIFAAV MLNLTDQAAM
LLKQNPELVH ARTTGNQTHP ESATPLHIAV DRGKQAMAEM LLDYGADPNS LDARGRPALY
LAAHIAGTRK LEPTPELVDL LLQHSTATPI FNASLIGQCA ELRELLIHDP AQIQALDQAG
YTALHLAAWN GQVAAVAELL AHDADIAART KRNETALQLA ITYGHHATAE LLLNHGATPD
IFSAVILGRI DLLEQLLDHQ SELASTTNRY GRTPLRLAIE REQTAVIDYL IGREVKPDLW
MAAGMGDFAR VEALVETDRH ALHQRDQWGY TALHWASKSG QLAVIEYLLE QGAGLEPRGS
DGGTPLTLAL WHEQSAAARL LVASGADIDA LDNWGGSPRN QVATL