Gene Hlac_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1404 
Symbol 
ID7400723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1411586 
End bp1412935 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content68% 
IMG OID643708465 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002566062 
Protein GI222479825 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.802678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCCGA CCGACCTGCT CTACGGCAAG TACCGGAACC TCATTCTCGC GACGGCCATG 
TTCAACCTCG GATTCGTAAT CTGGTTCTCG TTCGCGCCGT TCACCGGCGG TATCGCCGAG
GAGTTCGGGC TCTCAGTCCA ACAGCTCGGG GTCGTCGCGA GCGCGGCCAT CGTCGCCGTC
CCCCTCGGTC GCATCGTCAT CGGGCCGCTG ACGGATCGGT TCGGGGCGAA CGTCACGGGC
GGCGTCACGC TCGTCGTCGT CGGGACGTTC GCGATCATCA GCGCGTTCTC ACAGACGTAC
GAGGTGTTCA CCGCCTCACG GATCGTCGCG TCGCTCTCGG GGATCACGTT CGTCATCGGC
ATCCAGCACG TCGCCGAGTG GTTCGAGGAG GAGAACCTCG GGACCGCGGA AGGGATCTTC
GCCGGAATCG GCAACGCCGG GGCCGGGCTG GGCGCGTACT TCACCCTGCC GCGGATCTTC
GGTGAGGGAT ACGTCGACCC GATATTCGGC TCGGCGTTCC TCGCCACCTC GTCGAACTGG
CGGGCGGCGT TCTTCTACAC CGGCGCGCTC GCGGTGCTGC TCGGTGTCGT CTACTACGTC
TTCGGGGCCG CCGCGAAGAG CGAGGCGAAG CGGGAGGCGA CGAAGGCGGG CGTGAGCTGG
GAGCAGTGGC GGTTCATCGC GACCCGCTAC GGCGCGGTCG TCCTCTCCGT CGCGTACGTG
ATGTCGTTCG GCCTCGAACT CGCGATGAAC GGCTGGCTCG GGACCTACTA CCGCGAGGCG
TTCGGCCAGG GCGACATCGT GATCGCCGCG TCGTTCGCGG CGACGTTCTC GGTCGCCGCC
GGGCTCCTCC GGCCGATCGG CGGGTACGTC AGCGACGTGG TCGCGCGCAA CGAGCGGGAC
ATCCTCCCCG TCTTCGAGGG TGAGTACCGC AACCAGTGGA CGTTCGCCGC GCTCTCGTTC
GTCGTACTGT CGATGTTCGG CATGACGCTC GCCGGGCTGA CGGGGAACAT CTACGTCGCC
GTGGTCGCCG GCTTCCTCGT CGGCACCGGC TGCGCGTTCT CCGAGGGCGC GATCTTCGCG
CAAGTGCCCG CGATGTTCCC GGACAACTCC GGATCGGTGG CGGGGATCGT CGGCGGGATC
GGAACGGTCG GAGGCGTCGT CTACCCGCTC GCGTTCTCCG CCGCGTTCCT CCCGAACCTC
CACGTCGGCT ACGCGATCGT CGGCGCGTCG ATGATACCGA TCTTGGGGCT CGTGGCGTGG
GTGTTCCAGC CGAAGATCGC GGAGCGCGCG AACGACGACG GATGGTTCGT CTCCGGCGAG
CGGATCGTCG CGACCGGCGA CGACGACTGA
 
Protein sequence
MKPTDLLYGK YRNLILATAM FNLGFVIWFS FAPFTGGIAE EFGLSVQQLG VVASAAIVAV 
PLGRIVIGPL TDRFGANVTG GVTLVVVGTF AIISAFSQTY EVFTASRIVA SLSGITFVIG
IQHVAEWFEE ENLGTAEGIF AGIGNAGAGL GAYFTLPRIF GEGYVDPIFG SAFLATSSNW
RAAFFYTGAL AVLLGVVYYV FGAAAKSEAK REATKAGVSW EQWRFIATRY GAVVLSVAYV
MSFGLELAMN GWLGTYYREA FGQGDIVIAA SFAATFSVAA GLLRPIGGYV SDVVARNERD
ILPVFEGEYR NQWTFAALSF VVLSMFGMTL AGLTGNIYVA VVAGFLVGTG CAFSEGAIFA
QVPAMFPDNS GSVAGIVGGI GTVGGVVYPL AFSAAFLPNL HVGYAIVGAS MIPILGLVAW
VFQPKIAERA NDDGWFVSGE RIVATGDDD