Gene Haur_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1225 
Symbol 
ID5733118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1417675 
End bp1418673 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content50% 
IMG OID641278365 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001544001 
Protein GI159897754 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCG CATCGAATGT GGCAGGCAAA AGCCCTGCCC AGGCTCAAGA ATTTACGCGG 
CCTGTTCGGA GTCTTTGGAG CGATGCTTGG TTACGTCTGC GGCGCAATCG TTTAGCAATG
GCAAGTATTG TCTATTTGTT TTTGCTGGCG TTGGTAGCAA TTTTCGCACC AGTCATTGCC
CCGCACTCGC CTAGCCGCCC AACCTCAACC GAATTACGCG AACGAGGCAC CTATCGGCAA
GCTGCTTGGA TTGTTGATGA GAAAAACCCT AAACGCAGTG GGGTTTGGAA ATTTCCCTTA
GGAACAGATT CTGCTGGCGG CGATGTGCTC AGTCGCTTAA TCTATGGAAC GCGGGTTTCG
ATGGTTGTGG GCTTCATTCC CATGATCTTT ACCCTGACGA TCGGGATCAC GATTGGCTTG
GTTTCAGGTT TTGCCGGTGG CAAACTCGAT AGTTTGCTCA TGCGGTTTAC TGATATTGTC
TTTTCGCTGC CCGATATCTT GTTCTTTATT ATTGTGCAAA CGGCCTTCAG TCAAACCGCC
TTTGGCAAGA CCTTCAATGG TTTATTGTTG ATTTTCTTAT CATTCTCAGC GGTCAACTGG
GCTAGCGTTG CGCGTTTGGT GCGTGGCCAA GTGCTTTCTT TAAAAGAAAA AGAGTTTGTT
GAAGCAGCAG AGGCGATTGG GGTTCGGCGT GGCTCAATTT TATTTCGCCA TATTTTGCCC
AACACGCTCG CCCCAATTAT TGTGGCAGGT GCGTTTATTG TGCCAAGCGC GATTGTCACC
GAAGCAACCC TGAGCTTTTT GGGGATTGGC ATCCAGCCTG ATACCAACCC CAATAATCCG
TTCCCTACCA GCTGGGGCCA GATGATTTTG GAAGGTAAGT CGGCGATTGA TTCGCAACCA
TGGATTCTGA TCGCGTCGGC GATTGCAATT GCTTCAATTA CGATTGCTTT TGTGGCTTTG
GGCGATGGTT TACGTGATGC GCTTGATCCC CGCCAATAG
 
Protein sequence
MATASNVAGK SPAQAQEFTR PVRSLWSDAW LRLRRNRLAM ASIVYLFLLA LVAIFAPVIA 
PHSPSRPTST ELRERGTYRQ AAWIVDEKNP KRSGVWKFPL GTDSAGGDVL SRLIYGTRVS
MVVGFIPMIF TLTIGITIGL VSGFAGGKLD SLLMRFTDIV FSLPDILFFI IVQTAFSQTA
FGKTFNGLLL IFLSFSAVNW ASVARLVRGQ VLSLKEKEFV EAAEAIGVRR GSILFRHILP
NTLAPIIVAG AFIVPSAIVT EATLSFLGIG IQPDTNPNNP FPTSWGQMIL EGKSAIDSQP
WILIASAIAI ASITIAFVAL GDGLRDALDP RQ