Gene Hhal_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1966 
Symbol 
ID4710461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2162907 
End bp2164172 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID639856439 
Productprotein of unknown function DUF395, YeeE/YedE 
Protein accessionYP_001003532 
Protein GI121998745 
COG category[R] General function prediction only 
COG ID[COG2391] Predicted transporter component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.629772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTTCG ATTCCTTTGT CGCAGCGCAC TGGACGATGC TCGGCACCGT CTTCGCCATC 
GCGGTGTTGC TCGGTGCCGT CGTCAACAAG AGCAACTTCT GCACGATGGG CGCCGTCTCC
GACATCGTGA ACATGCAGGA CTGGCAGCGG ATGCGCATGT GGATCCTGAT CATCGCCGTG
GCGATCCTCG GCGTAGGGCT GCTCGAGCCC CTGGGGCTGA TCAACGCCGA CGAGAGCATG
CCGCCGTACC GCGCCTCCGA TTTCGCCTGG GCCGGCTACC TCCTCGGCGG CCTGCTTTTC
GGCATCGGCA TGACCCTGGG CAGCGGATGC GGCAACAAAA CGGTGGTGCG CATCGGCACC
GGCAACATCA AGTCGCTGTT CGTCGCGGCG GTGCTCGGCA CGGTCGCCTT CTTCATGACC
AACCCCCTGC CGCTGATCGA CGCCTCCCTG CGCGATCTGT TCTTCGGCTG GGTCAACGCC
ACCGCCATCT CCCACAGCCA CGGCCAGGAT CTGGGCAGCC TGATCGCCGG CGAGGCGGGG
CCGTGGGTGC GCCCCCTCCT GGCCCTGCTC ATCGGCGGTG CCCTGCTCTA CGCCGTTCTG
CGGGTCGCCG GCTTCCGCCA GGATCGCAAC GCCGTCTCTG GGGCACTGAT CATCGGCGCC
TGCATCGTTG CGGTGTGGAC GGTGACCAGC AACGTGTACG TGGCCGACGA GACGGGTCAA
CGCGACACCC TCCAAACCTA CGCCACGGAC TGGGACTTTC ACCACCCGGA CACCGATGCG
GGCCGCCCCG AAAGCACCCG CTGGCTGGCA CCGCAGGGGG TCAATTTCGT CGGCCCGCTG
GTACAGAGCA CCCAGTACAC CGCCAGCGGC TTCAATCCGG GGCTGATCAC CGTCGGTGTC
ATGGTGATCG GCGGCGTGAT CGTCGGCTCA TTCCTCTGGG CCCTGATCAG CCGCAGCTTC
CGCTTCGAGT GGTTCGCCGA CCGACAAGAC TTCAACCGAC ACCTCACCGG GGGTGTCCTC
ATGGGGATCG GCGGCCCGCT GGCCATGGGC TGCACCTTCG GCCAGGGTAT CACCGGCATG
TCCACGCTGG CCCTGAGCGC ACCGCTGGCC CTGGGCGGGC TGATCCTCGG CAGCGCCCTG
ACCATGAAGA TCCAGTACTA CAAGCTCCTC TACGAAGACG AGGCCACCTT TAGCAAGGCC
CTGGTCACCG GCCTGGTGGA CCTTCGCCTG CTTCCGGCGT CGCTGAGGCA GCTCGATGCG
CTTTGA
 
Protein sequence
MVFDSFVAAH WTMLGTVFAI AVLLGAVVNK SNFCTMGAVS DIVNMQDWQR MRMWILIIAV 
AILGVGLLEP LGLINADESM PPYRASDFAW AGYLLGGLLF GIGMTLGSGC GNKTVVRIGT
GNIKSLFVAA VLGTVAFFMT NPLPLIDASL RDLFFGWVNA TAISHSHGQD LGSLIAGEAG
PWVRPLLALL IGGALLYAVL RVAGFRQDRN AVSGALIIGA CIVAVWTVTS NVYVADETGQ
RDTLQTYATD WDFHHPDTDA GRPESTRWLA PQGVNFVGPL VQSTQYTASG FNPGLITVGV
MVIGGVIVGS FLWALISRSF RFEWFADRQD FNRHLTGGVL MGIGGPLAMG CTFGQGITGM
STLALSAPLA LGGLILGSAL TMKIQYYKLL YEDEATFSKA LVTGLVDLRL LPASLRQLDA
L