Gene Haur_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3500 
Symbol 
ID5735361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4407864 
End bp4408931 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content49% 
IMG OID641280647 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001546264 
Protein GI159900017 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0791674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTAT ATCTCATTCG GCGCTTGTTC CAAGCGATTC TCGTCCTTAT TTTATCGTCA 
GCAGTGATTT ACTCGCTGTT TGCCCTGGCT CCTGGTGGCC CACTTGAGGA GCTAACCCAA
GTTACCGACC CCAAGAATCG GCCTAGCCCC GAAGATATTC AACGTCAAAT CAAGTTGCTA
GGGGCCGATA AGCCTTGGTT CTTGTGGTAT CCAACTTGGC TCGCTGGTGA TACCTGGATG
GATAAAATTG GGTTTGAAGA ATATCAGGGC GAACGCAAGG GGATCTTGCG CTGGGATTGG
GGTACATCGT GGAAGTTTCA ACGTAATAAG CCAGTCTTGG AGATTATTGG CGATAAATTG
CCCGATACCC TGTGGTTGAT GATTTCATCA ACAATTATTT CGTTGGTGCT GGGCATTCCG
CTGGGGGTTT TCTCAGCAGT GCGCCAATAT TCTTTTTTTG ATTATGTGTT GACCACATTT
AGCTTTATTG GCTTATCACT GCCGGCCTTC TGGTTTGGTT TGTTGATTAT CGCGGTATCG
CTGTGGTTTA AACGCAATGG CTGGTTCTAC TTCCCCGCTG GCGATATTCT GGCCCTGCGT
AATTACGAGG TTCCGATTCT TGGCACGGTC GTCGCTGGCT CGTTGCTTGA TCGGGTGATG
CACTTGGTTA TGCCTGTTAC GGTGCTTTCA ATGCTCAACT TGGCCAACTG GAGCCGCTTT
ATGCGGGCGA GTATGCTTGA AGTGTTGAGC CAAGATTATG TGCGGACTGC CCGCGCTAAA
GGGGTCAAAG AACGCGTCGT GATCTACAAG CATGCCTTCC GCAATGCCTT GATTCCATTG
ATCACGATCA TCGTCTTTGC GATTCCTGGG GTGTTTGGTG GCGCACTGTT TACCGAAACA
GTCTTTAATT ATAAAGCGCT CGGCTTTACC TTTATTAGCG CTCTGAACCT CAAAGATTAT
CCTTTGGCGA TGGCCTTCTT GCTGATTTCG TCGATCTTGT TGGTGTTTGC GACGTTGCTG
GCGGATGTGC TCTATACCAT TGTTGACCCA CGAATTCGAC TTGACTAG
 
Protein sequence
MTVYLIRRLF QAILVLILSS AVIYSLFALA PGGPLEELTQ VTDPKNRPSP EDIQRQIKLL 
GADKPWFLWY PTWLAGDTWM DKIGFEEYQG ERKGILRWDW GTSWKFQRNK PVLEIIGDKL
PDTLWLMISS TIISLVLGIP LGVFSAVRQY SFFDYVLTTF SFIGLSLPAF WFGLLIIAVS
LWFKRNGWFY FPAGDILALR NYEVPILGTV VAGSLLDRVM HLVMPVTVLS MLNLANWSRF
MRASMLEVLS QDYVRTARAK GVKERVVIYK HAFRNALIPL ITIIVFAIPG VFGGALFTET
VFNYKALGFT FISALNLKDY PLAMAFLLIS SILLVFATLL ADVLYTIVDP RIRLD