Gene Hlac_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3661 
Symbol 
ID7402483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp420305 
End bp422323 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content59% 
IMG OID643710192 
Productpoly-gamma-glutamate biosynthesis/capsule biosynthesis protein 
Protein accessionYP_002567758 
Protein GI222481522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCTC TCTCTGGGTG CTCTCAGAGA ATTCAGCGGG TGATTCAGGA CGCTGCCGGC 
GACGAAGCAG CAGTCACCGG TACTGTCACC ACGGGCGGTG ACCCGTTATC GAACGCGTCA
GTGACCGCTT ATCGAAACGG CAGGGAAATC GCACGAGCCA CAACCGACGA TGATGGAACA
TACAACGTCT CTCTCGGTGG CTTCCCGGCT TGGGTTCGAT TTGACCACCC GGAGTGCAGC
TCAGTCACGA GGGCAGTCGC ACCAGGATCG GCAAGAAGTA TCAAACTGAA TTCGGGCGAG
GAGTCGGTCA GTTTAGCATT TGGCGGGGAC GTGATGTTCG GACGGCGGTA CTACGAGCCG
AGAGACGACC CACTCCGGTT CTATTACCGG TTACAACCGA CAGACCGTCG GGACTCTCAT
GACCGATTGC TTGACTCGGT CTCACCACTC TTCGGGGACG CAGATATTGC GTCGATTAAT
TTGGAGACGC CGCTGACGAC ATCAGAATGG AGGCACCCGT CAAAGGCGTT CGTTTTCACC
AGCCATCCTG TCGCTGCAGC GGCGATGGCC GACGCTGGAA TCGATTATGC GGCACTTGCG
AACACGCATG CCTTCGATGC CCTCACGCCA GGGCTTGAAG AGACGATTGA GTCTCTCGAC
AGGGTCGGAG TCGCCCATTC CGGCGCCGGT TCGGACTCCA CCACCGCCAT TGCTCCAGCC
ATTCTCGAAC GCGACGGGGT GACTGTCGGA TTCGTTTCGG TAACGACAAC AGCGGGCAGA
CAATACGAGC GCGATTGGGC GGCCGACGAG ACGACTGGGA CGTATACTGT CAATCGAGAA
GACGAAACGC TCACTGTTCG GGACAGTGCG GGGGTTGCCG ACGCGACGCC CGAAACGATT
CGTGCCGGCG TGGAGGCTGC GACCGACCAA GCGGACGTGG TCGTGACACA AATCCATGGC
GGGGAGGAGT ACCAGCGCAC GCCCACGCGG GAACTCCAGG ATTTGACCGA CACCGCGATC
GCTGCCGGCT CGGATCTGGT CGTGAACCAC CACCCACACG TGTCAGGGGG ACTTGAGACC
CGTGACGGCG CGCTCGTCGC GTGGTCGATG GGGAACCTCT TCTTTGACCA GAACCTCTGG
GCTACTTATC GATCGTTCAT CCTGCAGGTG ACAATCTCTC CCGACGGAAT ACAGTCGGCA
CGGGCGGAAC CGATCCTCAT TGAGGGTTAC ATCCCACGCG GGGTGACTGG ACCGCTCCGA
GACCGGCTGA CGTGGGAACT CGCGGGACTC TCGGACAATT CGTTCATGAT TACCGAGGAT
ACATTGGTAT ACCAGCCTGA CGACGAAAGG CCTACACCCG AACAACTGGC CCTTGACGGT
GGGGGCCAAC GTAGGGTTCG CGGGTGGGTT ACCGACTCCG ATGACTCGGT TCAACTTGGT
CGTGAGCGGT TCCTTACCGG GTCGTTCGAT GATCACGATG TTGATAGTGA CGCATACGAA
GGCACGCTGT GGCGCTACGG CCGTGAATCC CGTAGCAGCG ACCAACCTAT AGGGCGAGAT
GGGTCCGGAG GTATCGAACT AGTACGCGTT CAAGCAAACG AGAACCGAGC ACTATTCTCG
CCGTGGAACC GCCTGCCGGT CTCCAACAAG GAATTCACGC TGTCGGGATC ATACCGGACG
AACGCAGACG GAGAGCTTCG ACTGCTGGTC TCGTGGTACA ACGACACATC TGGAAGTTCG
TTCCAATCCC AAGAGATGTC ACTCGCGTCG ACGGAGCGTG AATGGACTGA CTTCTCACTT
GAATTAGAGC GGCCCGATGA GGCCACCCAT ATCGACGTCT TCGTGTTTCT GAGCCCACCG
GATGGCGTCG ATATCCTGCG TGCGGCGTTC GACACGCTGA GCCTCGTTGA GTGGGAGCCG
ACCGAGGTTG CCGGCGGCCG GCAGTTCGAT GTCATTCGGG GTTCGTCCGG AGCAACTGTC
CGTGTGATCC CTGTCGACGG TGAGGTGAGC TGGCAGTGA
 
Protein sequence
MASLSGCSQR IQRVIQDAAG DEAAVTGTVT TGGDPLSNAS VTAYRNGREI ARATTDDDGT 
YNVSLGGFPA WVRFDHPECS SVTRAVAPGS ARSIKLNSGE ESVSLAFGGD VMFGRRYYEP
RDDPLRFYYR LQPTDRRDSH DRLLDSVSPL FGDADIASIN LETPLTTSEW RHPSKAFVFT
SHPVAAAAMA DAGIDYAALA NTHAFDALTP GLEETIESLD RVGVAHSGAG SDSTTAIAPA
ILERDGVTVG FVSVTTTAGR QYERDWAADE TTGTYTVNRE DETLTVRDSA GVADATPETI
RAGVEAATDQ ADVVVTQIHG GEEYQRTPTR ELQDLTDTAI AAGSDLVVNH HPHVSGGLET
RDGALVAWSM GNLFFDQNLW ATYRSFILQV TISPDGIQSA RAEPILIEGY IPRGVTGPLR
DRLTWELAGL SDNSFMITED TLVYQPDDER PTPEQLALDG GGQRRVRGWV TDSDDSVQLG
RERFLTGSFD DHDVDSDAYE GTLWRYGRES RSSDQPIGRD GSGGIELVRV QANENRALFS
PWNRLPVSNK EFTLSGSYRT NADGELRLLV SWYNDTSGSS FQSQEMSLAS TEREWTDFSL
ELERPDEATH IDVFVFLSPP DGVDILRAAF DTLSLVEWEP TEVAGGRQFD VIRGSSGATV
RVIPVDGEVS WQ