Gene Haur_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1970 
Symbol 
ID5733859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2415151 
End bp2417355 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content51% 
IMG OID641279114 
Producthypothetical protein 
Protein accessionYP_001544741 
Protein GI159898494 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTTG GTCGTTCATG GATGATCATC GTTGCAGTTA TGATCATTGC GCTGCTCAGT 
GTGATCACTG CAACCGCCAT GGTCAGCCCC AGTGTCAGCT CATCGGATGG GCTTTGGCAG
AATGTTGCTG AACAGGACAT TCAACAAAAA GGCTCACGCG AAATCATTCC AGTAGTGTAT
CGCACGGTCG CCTTGGATCT TAATTTGCTA CAACAACACT TACGTCAAGT GCCTCAAGAA
GCCCAAACCA AGGTGCAAAA ATCTGGCTTT ATGTTAGATT TACCGCTACC AGATGGCCAA
TTTGGCAAAT TTCGCGTCGT CGAATCGCCA ATTATGGCTC CCGAATTAGC CGCCAAGTTT
CCTGAAATTC GCACCTTCTT GGCCCAATCA GTTGATCAGC CTGCAACCAG CGCTCGGCTT
GATATCACGC CACGCGGCTT TCATGGCATG ATCTTGAGCG AATCAGGTCG GATTTTTATC
GATCCATATA GCCGCAATGA TACGGCTAAC TATATTGTGT ACGATGCTCG CAATTTTGTG
GCCGACCCTA GCAAATTGGC CGAACGGACT GGCAACGATT ACGAGCCAAA TCCATTAGGA
AATCCATCGT CGATCATTCC TGAACGCTAC TCGATTGGTG AAACCTTGCG CACCTATCGC
TTGGCCATGG CTGCCACTGG CGAATACACC GCATTCCACG GTGGCACGGT CAATGGCGCG
ATGGCGGCAA TCGTCACCAG CATGAATCGG GTTAACGGAA TCTACGAACG CGATCTTTCA
GTGCGCATGC AATTAATCGC CAACAATGAT CTGATTGTGT ATACCAATGC GAGCAGCGAC
CCCTATACCA ACAATAGTGG TGGTACGATG CTTGGCCAAA ACCAAACCAA TTTGACCAAC
GTGATTGGCG GGGCCAACTA TGATATTGGC CACGTGTTCA GCACTGGCGG CGGTGGGGTC
GCTACTTTAC AATCAGTCTG TTCTTCAGGC AGCAAAGCGC GTGGGGTTAC TGGCTCAGGC
TCACCAGTTG GCGATGCCTT CGATGTCGAT TATGTCGCGC ACGAAATTGG TCACCAATTT
GGTGGTCTTC ACACCTTCAA TGGCTCAACT GGCAGTTGTA GTGGTGGCAA TCGTTCCAGC
AGCGCTGCCT ACGAACCAGG TAGTGGTACA ACCATTATGG CTTATGCCGG GATTTGTGGC
TCGGAAAACT TGCAACCAAA CAGCGACTTC GACTTCCACG TCAAGAGCTT AGAAGAAATT
TCAGCCTTCA TCACCACTGG CGGCGGCGCA ACCTGTGGCA CAACTCAAGC CACTGGCAAT
ACCCCACCAG TCGCCAATGC AGGCAGCGAT TACACCATCC CAGCCAATAC GCCATTCGAA
TTGACTGCTA GCGCTAACGA TGCTGAAGAC AATAGTTTGA CCTACGATTG GGAACAATAC
GATTTGGGTG CGGCCTCACC ACCAAACACC GATAACGGCA ATCGCCCAAT CTTCCGTAGC
TTCAATTCAA CCGCCAGCAA TGTGCGTACC TTGCCAAAAC TGAGCGATAT TTTGAACAAT
ACCACGACGA TTGGCGAATC GTTGCCAACT ACTAACCGCA ACCTGACCTT CCGCTTGACC
GTGCGTGATA ACCACGCTGG GGCTGGTGGT TATGGCTTGG ATACGGCGGT TCTGACTGTC
AACAATACTG CTGGACCATT CTTGGTAACT GCACCAAACA CGGCAATCGC CTGGACAGGC
GGCGCGAACG AGTCGGTCAC GTGGAATGTT GCCAACACCA CCGCTGCGCC AATTAGCTGT
GCTAATGTTG ATATTTTGCT CTCGAAAGAT GGCGGCACAA GCTTTGAAGC CTTGGTCAGC
AACACCCCCA ACGATGGCGA TGAAACTGTT GTTGCGCCAA ACGTTAATGC TGCTGCTGCG
CGGATCAAAG TGCGTTGTGC TAACAATATC TTCTTTGATA TTTCTAACGC CAACTTTGCG
ATCAACGGGG TCAATATCAC ACCAACTCCA GTTACACCAA CATTAACCCC AACCAATACT
CCCACTCGCA CGCCAACATT AACCCCAACT CAAACATCAA CCCCAACCCA AACCGCGACG
GCGACCCCAA CAGCTAGCCC AACCGTCAGC GTTACGCCGA CGGCGAGCGT TACACCAACC
CCTGAGAACT ATAGCGTTTA TCTGCCTGTC GCAATCAAAA ATTAA
 
Protein sequence
MRLGRSWMII VAVMIIALLS VITATAMVSP SVSSSDGLWQ NVAEQDIQQK GSREIIPVVY 
RTVALDLNLL QQHLRQVPQE AQTKVQKSGF MLDLPLPDGQ FGKFRVVESP IMAPELAAKF
PEIRTFLAQS VDQPATSARL DITPRGFHGM ILSESGRIFI DPYSRNDTAN YIVYDARNFV
ADPSKLAERT GNDYEPNPLG NPSSIIPERY SIGETLRTYR LAMAATGEYT AFHGGTVNGA
MAAIVTSMNR VNGIYERDLS VRMQLIANND LIVYTNASSD PYTNNSGGTM LGQNQTNLTN
VIGGANYDIG HVFSTGGGGV ATLQSVCSSG SKARGVTGSG SPVGDAFDVD YVAHEIGHQF
GGLHTFNGST GSCSGGNRSS SAAYEPGSGT TIMAYAGICG SENLQPNSDF DFHVKSLEEI
SAFITTGGGA TCGTTQATGN TPPVANAGSD YTIPANTPFE LTASANDAED NSLTYDWEQY
DLGAASPPNT DNGNRPIFRS FNSTASNVRT LPKLSDILNN TTTIGESLPT TNRNLTFRLT
VRDNHAGAGG YGLDTAVLTV NNTAGPFLVT APNTAIAWTG GANESVTWNV ANTTAAPISC
ANVDILLSKD GGTSFEALVS NTPNDGDETV VAPNVNAAAA RIKVRCANNI FFDISNANFA
INGVNITPTP VTPTLTPTNT PTRTPTLTPT QTSTPTQTAT ATPTASPTVS VTPTASVTPT
PENYSVYLPV AIKN