Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2640 |
Symbol | |
ID | 5734518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3385348 |
End bp | 3388926 |
Gene Length | 3579 bp |
Protein Length | 1192 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279780 |
Product | chromosome segregation protein SMC |
Protein accession | YP_001545406 |
Protein GI | 159899159 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02168] chromosome segregation protein SMC, common bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCTCA AACGACTTGA AATTCAGGGA TTTAAAACCT TTGCCAATCG CACAGTGATT GAGTTTCCGC TAGGAGTGAC GGCGATTGTC GGGCCAAATG GTTCGGGCAA GTCCAACGTT ACCGATGCGA TTCGTTGGGT GCTGGGCGAA CAAAGTTTCT CGGCTTTGCG CTGTCGCCGC ACCGAAGATC TCATTTATAG CGGCGGTGGC AAACGTGCCG CCCAAGGTAT GGCTGAAGTC GCCCTAACGA TCGATAATAC CGATCGCACC TTGCCGCTCG ATTTCAACGA AGTCACAATC ACGCGCCGCT CGTTTCGCTC TGGCGAGAAT GAATATTTCT TGAATAAAAA TAAAGTTCGT TTGCGCGATA TTCAAGAAGC CACATCGCCC TTGGCTTCGA GTTATACCCT GATCAATCAA GGTTTGGTTG ATGCTGCCTT GACTTTACGT CCCGAGGAGC GCCGCAGCTT ATTCGAAGAT GCCGCCTCGA TTAGCCTTTC GGTCAGTAAA CGCGCCGAGG CCGAGCGGCG ACTCAAACAA ACCGAAGACA ATCTAGGCCG CATTTTGGAT ACCTTGGCTG AGATCGAGCC ACGTTTGAAG GTGCTGCGCC GCCAAGCGCG TGAGGCTGAG CAGGTTCATG AAATTGAAAC CGCCTTGCAG CAAGCCCTGC TGATTGCTTA TCGCCGCCAG TGGCAGGCCG CCCAAAGCTT GGTCGCTCAA GCCGAAATTG CTTTTGCTAA GGCCGAAAAA GTTCTAGCTC ACGCTCAAGT TGGCCAGGCT CAAGCCGAGC AACAACTTGG TGCATTACGC CAGCAGCGTG ATCAGCAGCA GCAATTGGTT GAGCAACAAC ACCAGCAATT GGCCCAATTT GAGCGTGAAT GGGAAGCCGT GGAGCGTGAA TGGGCGGTGT TGACTGAGCG CCAACAAGCG CTGATCACCC GTCGCGCTGA GGCTCACGAG CGTCAGCAAA GCCTAGAGCG AGAGCAAACC CAAGCCCAAG CGCGAGCGCT TGAGTTAGCT GAACAGGCTG CCCAAGCCCA AGCGGCCTTG ATTGCTCAGC GCCACGCCTT AACCACAATT GAGGCTGGCG AACAAGCGAC TATTGCCGCC CGCCGCGAAG CTGATCAAGC CCTCAAACAA GCCCAAGAAG CTGCTTTGCA AGCAACCAGA ACCTTGAACG AGGCTCAGCA ACGGGCCAAA TTGCAGCAAG AGCGTTGGCA AGCGTTGGCC AATGAATTAA CCACCAACCA ACAGCAACTT GCCGCTAGCC AAACGCGCAT TCAGCAAGCC CAAACCACCC TTGAACAAGC CCAACAGCTA TATCAACAGG CCGAAGCTGA GTTTAACCAA ACTCAAGCCA ATGAAGAGGC TGCCCAAAAA AGTTTGTTGG AAGCCCGTAA TCTGCGCCGT GAGCAAGAAG AAGCTGTGGC CTACGCCCGC CGTGAAGCCG ATGCCTTGCA TAGTCGCTTG GATGCCTTGC GCCGCACTGC CGCTGCCGGA GCTGGCATGT TTACTGGGGT TCGGGCGGCT TTGCAATGGG CCGAAAAGCA GCACCAGCAA TTTGCCGTGG TTGCCAGCGT GATTGAAGTG CCTGCTGAGT TAGAAACGGC CTTGGAAGTG GCGTTGGGTG CACGCTTGCA AAATATTATC GTGCCCAATT GGGAAGCCGC CGAAGCCGCG ATTGCTGAAC TCAAACGCAC TGATGCTGGC CGTGCTACCT TTTTGCCGCT CGATACGCTA CGCACGCCAC GCCCAAGCCG CATTCCCCAA GGCAAAGGGG TTTTAGGGCT TGCCAGCGAG TTGGTGAACT ATGCCGAGGC CTATGAACGG GCGGTGCAGC ATTTGCTTGG GCGCACAATT GTGGTCGAAG ATTTAGCCAC TGCACGGCGA GTTTTAGCCG AACTTGATGG TGCTTGGACG ATTGTGACGC TTGGTGGCGA ACAAGTTGGC TCATCGGGGG CGATGACTGG TGGCGCTAGA ACTCGTGAGG CTGGTACGTT GCGCCGCGAA CGAGAATTGC GTGAATTGCC CGCCCAACTT GAGGCCGCCC AACAACAACT CGCCGAGCAT GAACAACAGC TCAAGCAGGC GATTGCCGCA ATTGGCAATG CTGAAAAAGC AGTTCGTGAG GCCGAGCAAG CCCGCCGCAG CAACCGCACT ACCATGGAAA AAGCTCGTGA TAGTTTGGCG CAACGCCAAC GTGGCCTTCA GCAAATTGAG CAAGAGCAAC AATGGCAACA ACTGCGCAGC AACAATGTTC AACAAGAGCA AGCCCGCCTT GCCGAGCAAC TTGCCGCCAG CCAAACCGCA ATTGAGCAGG CGCAAATCCA TGGTGCTGAG CATGAACGTT TGGTTGTTGC TGCCCGTGAG CAAGCCGAAA CCGCAGCGGC GGCCAGTCGG GCTAGCGAAG AACGGGTTGC GGCAGCACGA GCAGCGATTG TCTCCAGCGA TGCAACCTTG CAGGCGACCC AGCGTGCCCA ACGCGATCAG CAACAGGTGA TTGATGGCTT GGTGCGGCGG AGCCGTGAAG ATCAGCAACG CCAGCACGAG TTGGTTGAAC AATTGGCCTT GGCTGAAGCC CAAAGCGTGC TTGTAGCCGA ACGCCGTCAG CAAGCCCAGC TGCAACGTGA GGCCGTTTAT GCCGAGCAAG CCCCGTTGCT CGAACGCTTG AATCAGGCAA ATTTGGCGCT GCAACACGCC GAATTGCAAG AGCGGGCAGC CTCGCAGGCC TTTGTTCAAG CGCAAACCAG CCTTAGTCGG GCTGAAAGCC AATTGGCGAC AGCCAACACT CGCCGCGACC ATGTGTGGGA ACGTTGCGCC GAAGAGAATA TTGATATTGA GCAATTGGAT TTGCGCCAAA CGCCTGAGGT TGAGCAGGCT GAATTATCGG CCCAAGCCTT GAACGAGCAA ATTGATCAGT TGCGCAACAA ACTGCGGCGA ATGGGCACGA TTAACCCACT TGCGCCGCAA GAATATGCTG AGCTTGGCGA ACGCAATCAA TTCTTAACGG GCCAAGTCAG CGATATTCGC CAAGCGGCTG ATGGCCTGCG TGAGTTGATT AACGAGCTTG AAACGGCCAT GAATAGCCGT TTTGCCCAAA CATTTAGTGC CGTGGCCGAA GAATTTAGCC TAGCCTTTAC CCGTTTGTTT GGTGGTGGTA CAGCCCAGTT GATTTTGAAC GACCCCAATA GCAGCGAAAG TGGCATTGAT ATTATTGCCC AACCACCAGG CAAACGCCGC CAACCGCTCT CCTTGCTCTC TGGGGGCGAA CGTTCATTGA CCGCCGTGGC ACTGTTGGTG GCCTTGTTGA AAGTTAATCC CACGCCGTTT TGTGTGATGG ACGAAGTTGA CGCAGCCCTG GATGAAGCCA ATGTTGTGCG ACTTCGTGAA CAATTAATTG AGATGAGTCA ACAGACCCAG TTTGTCTTGG TGACGCATAA CCGTGGTACT GTGGAAGCGG CTTCAACCTT ATATGGCGTG ACGATGAATC CTGATGGAGC TTCCAAAGTG TTGTCGATTC GCCTTGATCA GTTGGTCGAT GATGGTGGGG CAGTGCGAAT TGTTGAGACA GTTGGTTAA
|
Protein sequence | MYLKRLEIQG FKTFANRTVI EFPLGVTAIV GPNGSGKSNV TDAIRWVLGE QSFSALRCRR TEDLIYSGGG KRAAQGMAEV ALTIDNTDRT LPLDFNEVTI TRRSFRSGEN EYFLNKNKVR LRDIQEATSP LASSYTLINQ GLVDAALTLR PEERRSLFED AASISLSVSK RAEAERRLKQ TEDNLGRILD TLAEIEPRLK VLRRQAREAE QVHEIETALQ QALLIAYRRQ WQAAQSLVAQ AEIAFAKAEK VLAHAQVGQA QAEQQLGALR QQRDQQQQLV EQQHQQLAQF EREWEAVERE WAVLTERQQA LITRRAEAHE RQQSLEREQT QAQARALELA EQAAQAQAAL IAQRHALTTI EAGEQATIAA RREADQALKQ AQEAALQATR TLNEAQQRAK LQQERWQALA NELTTNQQQL AASQTRIQQA QTTLEQAQQL YQQAEAEFNQ TQANEEAAQK SLLEARNLRR EQEEAVAYAR READALHSRL DALRRTAAAG AGMFTGVRAA LQWAEKQHQQ FAVVASVIEV PAELETALEV ALGARLQNII VPNWEAAEAA IAELKRTDAG RATFLPLDTL RTPRPSRIPQ GKGVLGLASE LVNYAEAYER AVQHLLGRTI VVEDLATARR VLAELDGAWT IVTLGGEQVG SSGAMTGGAR TREAGTLRRE RELRELPAQL EAAQQQLAEH EQQLKQAIAA IGNAEKAVRE AEQARRSNRT TMEKARDSLA QRQRGLQQIE QEQQWQQLRS NNVQQEQARL AEQLAASQTA IEQAQIHGAE HERLVVAARE QAETAAAASR ASEERVAAAR AAIVSSDATL QATQRAQRDQ QQVIDGLVRR SREDQQRQHE LVEQLALAEA QSVLVAERRQ QAQLQREAVY AEQAPLLERL NQANLALQHA ELQERAASQA FVQAQTSLSR AESQLATANT RRDHVWERCA EENIDIEQLD LRQTPEVEQA ELSAQALNEQ IDQLRNKLRR MGTINPLAPQ EYAELGERNQ FLTGQVSDIR QAADGLRELI NELETAMNSR FAQTFSAVAE EFSLAFTRLF GGGTAQLILN DPNSSESGID IIAQPPGKRR QPLSLLSGGE RSLTAVALLV ALLKVNPTPF CVMDEVDAAL DEANVVRLRE QLIEMSQQTQ FVLVTHNRGT VEAASTLYGV TMNPDGASKV LSIRLDQLVD DGGAVRIVET VG
|
| |