Gene Haur_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3664 
Symbol 
ID5735525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4606508 
End bp4608013 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content51% 
IMG OID641280813 
ProductL-arabinose isomerase 
Protein accessionYP_001546428 
Protein GI159900181 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACT TGAAACAATA CGAAGTTTGG TTTATTACGG GCAGCCAACA TTTGTATGGC 
CCCGAAACTT TAGAACAAGT TGCCAAACAC TCCCAAATTA TCGCCGCTGG GCTTGATCAA
AGTAGCACAA TTCCGGTGCG GGTGGTGTTT AAGCCTGTGC TTAAAACGCC CGACGAAATT
TATAATTTGG TGCTCGAAGC CAACGCCGCC AAAAACTGCA TTGGCTTGGT CGCGTGGATG
CATACCTTCT CGCCAGCCAA AATGTGGATT GCCGGCCTAC GCAGCTTGCA AAAACCATTG
GCTCACTTGC ACACCCAATT CAACAGCGAA ATTCCATGGT CGGAAATCGA CATGGATTTT
ATGAATCTCA ACCAAGCCGC CCACGGCGAC CGCGAATTTG GCTTTATTGT CAGCCGCATG
CGCTTGGAAC GCAAAGTGAT TGTGGGTCAT TGGCAAGATA GCGAAGTGCA TGATCGAATT
GCTGCCTGGA CTCGCGCCGC CGCTGCTTGG CACGATGCTC AAGGCGCACG CTTCGCCCGC
TTTGGCGACA ACATGCGTGA AGTTGCCGTA ACCGAAGGCG ATAAAGTTAA TGCTCAAATG
CGGCTTGGCT ATAGCGTCAG CGGCTATGGC GTGGGCGATC TAGTGCGCTT TGTCAACGAA
GTTAGCGATG CCGATATCGA CAGCACGTTG CAAGAATATG CCGAACAGTA CGAATTGGCA
GCAGGTTTAC AAGCTGGCGG CGAGCAGCAT CACTCGCTAC GTGAAGCCGC CCGCATCGAG
CTAGGCTTGC GCTATTTCCT TGAGCATGGC AATTTCAAAG GCTTTACAAC GACGTTTGAA
GATTTACATG GCTTAGTGCA ATTGCCAGGT TTAGGCCCAC AACGCCTGAT GGATCGTGGC
TATGGCTTTG CTGGCGAGGG CGATTGGAAA ACCGCTGCGC TGGTGCGGGC AATGAAGGTG
ATGAGCGCTG GCCTCAACGG TGGCACTTCT TTCATGGAAG ATTACACCTA TCACTTTGGC
AGCAACGGCA TGAAAGTGCT AGGCGCACAT ATGCTCGAAA TCTGTCCATC AATTGCTGCG
ACTAAACCAC GGCTCGAAGT GCATCCCTTG GGCATTGGTG GCAAGGCTGA TCCGGTGCGT
ATGGTATTTG ATGCCAAAAC TGGACCTGCC GTCAACGCCT CAATCGTGGA AATGGGCAAT
CGTTTGCGCT TGATCAATAG CGTGGTTGAT GCAGTTGAAA CCGACCAACC CTTGCCCAAA
TTACCAGTTG CCCGCGCCTT GTGGCTACCC CAGCCCGATC TCAAAACCGC TGCGGCAGCT
TGGATCTACG CAGGCGGAGC GCATCACACG GGCTTTAGTT TTGATCTGAC CAGCGAACAT
CTCGCCGACT TTGCTGAAAT TGCTGGCATG GAATATTTGC AAATTGATCG CAACACCAAC
GTTCAGCAAT TCAAACAAGA ACTTCGTTGG AACGATCTGT ACTATCACTT GGCCAAAGGT
TTGTAA
 
Protein sequence
MLDLKQYEVW FITGSQHLYG PETLEQVAKH SQIIAAGLDQ SSTIPVRVVF KPVLKTPDEI 
YNLVLEANAA KNCIGLVAWM HTFSPAKMWI AGLRSLQKPL AHLHTQFNSE IPWSEIDMDF
MNLNQAAHGD REFGFIVSRM RLERKVIVGH WQDSEVHDRI AAWTRAAAAW HDAQGARFAR
FGDNMREVAV TEGDKVNAQM RLGYSVSGYG VGDLVRFVNE VSDADIDSTL QEYAEQYELA
AGLQAGGEQH HSLREAARIE LGLRYFLEHG NFKGFTTTFE DLHGLVQLPG LGPQRLMDRG
YGFAGEGDWK TAALVRAMKV MSAGLNGGTS FMEDYTYHFG SNGMKVLGAH MLEICPSIAA
TKPRLEVHPL GIGGKADPVR MVFDAKTGPA VNASIVEMGN RLRLINSVVD AVETDQPLPK
LPVARALWLP QPDLKTAAAA WIYAGGAHHT GFSFDLTSEH LADFAEIAGM EYLQIDRNTN
VQQFKQELRW NDLYYHLAKG L