Gene Hlac_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0218 
Symbol 
ID7402147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp235976 
End bp237970 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content74% 
IMG OID643707281 
Producttype II secretion system protein 
Protein accessionYP_002564893 
Protein GI222478656 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1955] Archaeal flagella assembly protein J 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.444432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0181412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG AGCCACAGAG AGGAGCCGAC GCCGCGGGAG GCGGCCGCGC CCTCGACGCG 
GCCGAGGCCG ACGGGTTCAC GCGGACCAGC GGCTCCTCGG TGGTCGACCG CGTGCTGTAC
GCTCTGTTCG CACGCCACGC GAGCGACCGC CGGCACGACG CCGACCGGAA GCGGTACCGT
GGGACTGCGC TCGACACGGG GTTCGAGACG CATTTGGCGC GGGTGTACGG GCTCTCGTGG
CTCGTCTGCG TTGCCGTCAT CGTCCCGGCG CTGCTTGTCG CCGCGTCGAC CGTGCCGGGG
TTACTCGCGG CGGCCGACGC CTGGATCGGC GCGCGTTCGC CGGTCCCGAT CGGGTCGCTG
TCGCCCGAGC GCGTTGCCGT CGTCGCCGCC GGCCTCACCG GGTTCCTCGC GAAGCGCGCA
ACGGTGCGGC TCGGTGGGCT CCACCTCCGG TGGCTCACGG CGACGCGCCG GACCGACATC
GAGCGGACCC TCCCGCCCGC GGTCCGCTAC CTCGAACTAC TCGCGGCCGG GAGCGACGGC
CCCCGGAAAA TGGTCGAGAA GGCCGCCGCG AACGACGCGT ACGGCGGGAC CGCCACCTCG
CTACGGAAGG CGCTCAACGC CGCGCGACTC GCCGGGAGCC TCGACGAGGG ACTGCGTCGG
GTCGCCCGCG ACACGCCCTC GCGAGAGCTG CTCGCGCCGT TCCTCCTGAA GTTCCGCAAG
CACGCGGCGA CTGGCGACGC GGCGCTCGCG GAGTACCTCC GGACGGAGAG CCGGATGCTC
GCGCACCGGC AGGACCGGGC CCGCAAGCGC GCCCGGCGAT TCCTGGAGCT TCTCACCGAA
CTGTTCGTCG TCGTGTTGGT GTTGCCGGCG CTGCTCGTCA TCGCGGCGAC CGCGCTCACG
GTCGTCGTCC CCGAGCTGCT CCCGCCGGTC GACACGCCGG TCGGCGTGGT GCCGACGCGG
GCGGTCGTCC TCTACGGCGC CGTCGTCTTT CTCGTTGCTA TCGGCCTCGT CGGTGCGGTC
GCGGTCGGCA CGCTCCGCCC GCCGAACCAG CGCGCGAGCT ACGACTTGCC GGCGTTCCCC
CGCAAAATCC TCGCCTCGGC CGGGCGCAAT CCGTCGAGCG CCGCGGTCGT CTGCGCGCCG
CCCGCGGCGC TGCTCGCGGT CGGGCTCGCG TTCGCGGGGT ACACTCTCGT CAACGTCGCG
CTGCTCTCGT ACGCGGCGTT CGCGGTTCCC GTCGGCGCCG TCGCGGGGAG ACGCACCCGG
ATCGACGACG CGAAGGACCG CGAGCTGGCG GACTTCGTCC ACGCCGTCTC GGGACACGTC
GCGCAGGGGC GGCCGTTCCC CGCGGCGGTC GAGGCGGTCG CGCGCGACGT GGACCTCGGC
GTGCTCGACG ACGACGTGGC CGACCTCGCG TTCGCGCTCC GGTCGACGAC CGCGACGGGG
GGTGCGGGGA GTACGGAGCG TGCGAGAGAC ACGGGGAGTA CGGGTGCGTC GGCACAGTCG
CCCGGCGTCC GCGCGGCCGC GATCGATCGG TTTGTCCAGC GCGTCGGAAC GCCGCTGGCC
GAGGGGACAC TCGGGCTCGT CACCGACGCC TTAAACGCCG GGAGCGACGC CGACGCCGTC
TTCGAGACGC TTCGGATCGA GGTCGGCCGG CTCTACAGCG AGCAGCGCGC GCTCCGCTCG
TCGATGCACC CCTACGTCGC GGTCGGCTGG GCGGCGGCCG CGCTGGTCGC GGCGGTCGTC
GCCGTCGTCA ACACGCAGGT GATCGACGCC GCGCACCTCG CCGACCTCGC CGGGGCAACC
GATTTCGTGG CCGAGCCGGA GACGGTGCTT CCGGAGCTGG AGCGCTTCCG GCTGTACGTC
GTCACGCAGG CGACGATGCT CGCCTCGGGC TGGTTCGCCG GCACCGCCTC CCGCGGACGC
TACGCGGCGC TGTTCCACTC CGGGCTGCTC GTGGCGATCT GCTACGCGGT GTTCACCGCG
GGCGGGCTGG TGTGA
 
Protein sequence
MSGEPQRGAD AAGGGRALDA AEADGFTRTS GSSVVDRVLY ALFARHASDR RHDADRKRYR 
GTALDTGFET HLARVYGLSW LVCVAVIVPA LLVAASTVPG LLAAADAWIG ARSPVPIGSL
SPERVAVVAA GLTGFLAKRA TVRLGGLHLR WLTATRRTDI ERTLPPAVRY LELLAAGSDG
PRKMVEKAAA NDAYGGTATS LRKALNAARL AGSLDEGLRR VARDTPSREL LAPFLLKFRK
HAATGDAALA EYLRTESRML AHRQDRARKR ARRFLELLTE LFVVVLVLPA LLVIAATALT
VVVPELLPPV DTPVGVVPTR AVVLYGAVVF LVAIGLVGAV AVGTLRPPNQ RASYDLPAFP
RKILASAGRN PSSAAVVCAP PAALLAVGLA FAGYTLVNVA LLSYAAFAVP VGAVAGRRTR
IDDAKDRELA DFVHAVSGHV AQGRPFPAAV EAVARDVDLG VLDDDVADLA FALRSTTATG
GAGSTERARD TGSTGASAQS PGVRAAAIDR FVQRVGTPLA EGTLGLVTDA LNAGSDADAV
FETLRIEVGR LYSEQRALRS SMHPYVAVGW AAAALVAAVV AVVNTQVIDA AHLADLAGAT
DFVAEPETVL PELERFRLYV VTQATMLASG WFAGTASRGR YAALFHSGLL VAICYAVFTA
GGLV