Gene HS_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1014 
Symbol 
ID4240507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1118086 
End bp1119444 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content39% 
IMG OID638104570 
Productpermease 
Protein accessionYP_719225 
Protein GI113461157 
COG category[R] General function prediction only 
COG ID[COG2056] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.747204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTAA CTAACCCAGT AGTAATTTCC ATTATTGTGT TACTCGCCTT AAGTCTTATG 
CGAATTAATG TTGTTATGGG ATTAGTTATC TCAGCATTAG TTGCCGGGCT ATGTAGCGAT
TTAACCATTA ATGAAACAAT AAAAGCGTTC ACAGGCGGTT TAGGCGGGGG GGCTGAAGTT
GCAATGAACT ATGCAATTTT AGGTGCTTTT GCTATTGCCA TCTCTAAATC AGGTATTACC
GATCTACTTG CTTTTAAAGT AATTAAACGC TTAGGTAAGT CGCCAACAGG TAATTCAATG
GCTGGATTTA AGTACTTTAT TCTTGCCATA CTAGCGTTAT TCTCAATTTC TTCGCAAAAC
TTATTACCAG TACACATCGC ATTCATTCCA ATTGTTATTC CCCCACTCCT TTCAATTTTT
AATAAACTAA AAGTTGACCG TCGTGCGGTT GCTTGCGTAT TGACATTTGG ATTGACTGCA
ACCTACATGC TTCTCCCTGT CGGTTTCGGT AAAATTTTTA TTGATAGTGT TTTAGTAAAA
AATATTAATC AAGTAGGTGC TTCACTCGGC TTAAAAACCA GTGTAGCAGA AGTCTCTTTG
GCAATGGCAA TTCCGGTTAT CGGTATGGTT ATCGGTTTAT TAACCGCAGT TTTCGTCACT
TATCGTAAAC CTCGAGAGTA TGTTTCAGCC GGTTTAAATG CTACAACTGA AGAAATCGAA
AAGCACATTG CAAATATTAA GCCCTTCCAT GTGATTGTAA GTGCGGTCGC AATTGTACTC
ACTTTCGCTT TACAACTGAT AACCAGTTCA ACAATTATCG CTGGTCTTGT CGGTCTCATT
ATTTTCGCAG TATTTGGTAT TTTTAAACTA AAAGAAAGCA ATGATATTTT CCAACAAGGC
TTAAAATTAA TGGCAATGAT CGGTTTCGTA ATGATTGCCG CTTCGGGCTT TGCTGCGGTA
ATTAATGCAA CTGGCGGTGT AACAATTTTG GTGGATAATT TCAGCCAAGG CTTGGGTGCA
GAAAATAAAG GTTTAGCTGC ATTTCTTATG TTGCTTGTAG GTCTATTTAT TACCATGGGA
ATCGGTTCAT CCTTTTCAAC TGTACCAATT ATTACATCTA TTTATGTGCC GCTTTGTCTT
TCCTTTGGAT TCTCACCGTT GGCAACTGTT GCAATTATCG GTGTTTCTGC CGCACTGGGC
GATGCTGGCT CACCTGCCTC AGACTCAACT TTAGGTCCGA CATCCGGCTT AAATATGGAC
GGTAAACATG ATCATATTTG GGATTCTGTT GTACCAACAT TCATCCACTA CAATATTCCG
CTCATTCTTT TTGGTTGGAT TGCGGCAATG TTCCTTTAA
 
Protein sequence
MILTNPVVIS IIVLLALSLM RINVVMGLVI SALVAGLCSD LTINETIKAF TGGLGGGAEV 
AMNYAILGAF AIAISKSGIT DLLAFKVIKR LGKSPTGNSM AGFKYFILAI LALFSISSQN
LLPVHIAFIP IVIPPLLSIF NKLKVDRRAV ACVLTFGLTA TYMLLPVGFG KIFIDSVLVK
NINQVGASLG LKTSVAEVSL AMAIPVIGMV IGLLTAVFVT YRKPREYVSA GLNATTEEIE
KHIANIKPFH VIVSAVAIVL TFALQLITSS TIIAGLVGLI IFAVFGIFKL KESNDIFQQG
LKLMAMIGFV MIAASGFAAV INATGGVTIL VDNFSQGLGA ENKGLAAFLM LLVGLFITMG
IGSSFSTVPI ITSIYVPLCL SFGFSPLATV AIIGVSAALG DAGSPASDST LGPTSGLNMD
GKHDHIWDSV VPTFIHYNIP LILFGWIAAM FL